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In re the Application of 

Glaucia PARAHNOS-BACCALA, Florence KOMURIAN- 
PRADEL, Frederic BEDIN, Mireille SODOYER, Catherine 
OTT, Francois MALLET, Herve PERRON and Bernard 
MANDRAND 

Application No.: U.S. National Stage of 
PCT/FR98/01460 

Filed: June 2,1999 Docket No.: 103514 

For: RETROVIRAL NUCLEIC MATERIAL AND NUCLEOTIDE FRAGMENTS, IN 
PARTICULAR ASSOCIATED WITH MULTIPLE SCLEROSIS AND/OR 
RHEUMATOID ARTHRITIS, FOR DIAGNOSTIC, PROPHYLACTIC AND 
THERAPEUTIC USES 

PRELIMINARY AMENDMENT 

Assistant Commissioner of Patents 
Washington, D. C. 20231 

Sir: 

Prior to initial examination, please amend the above-identified application as follows: 
IN THE CLAIMS : 

Please amend claims 8, 13, 18, 20, 22, 23, 25 and 26 as follows: 
Claim 8, line 4, delete "at nucleotide [sic]". 

Claim 13, lines 1-2, change "any one of the preceding claims" to --claim 3-. 
Claim 18, line 5, change "any one of claims 14 to 17" to -claim 14~. 
Claim 20, line 7, change "any one of claims 8 to 1 1" to -claim 8~. 
Claim 22, lines 3-5, change "any one of claims 1 to 7 or a fragment according to any 
one of claims 14 to 17" to -claim 1-. 

Claim 23, lines 2-3, change "any one of claims 14 to 17" to -claim 14-. 
Claim 25, lines 5-6, change "any one of claims 14 to 17" to -claim 14-. 
Claim 26, lines 7-8, change "any one of claims 14 to 17" to -claim 14-. 



U.S. National Stage of PCT/FR9 8/0 1460 



REMARKS 



Claims 1-26 are pending. By this Preliminary Amendment, claim 8 is amended to 
eliminate unnecessary language and claims 13, 18, 20, 22, 23, 25 and 26 are amended to 
eliminate multiple dependency. Prompt and favorable examination is respectfully requested. 
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WO 99/02666 - 1 - PCT/FR98/01460 

RETROVIRAL NUCLEIC MATERIAL AND NUCLEOTIDE FRAGMENTS , 
IN PARTICULAR ASSOCIATED WITH MULTIPLE SCLEROSIS AND /OR 
RHEUMATOID ARTHRITIS, FOR DIAGNOSTIC, PROPHYLACTIC AND 

THERAPEUTIC USES 

Multiple sclerosis (MS) is a demyelinizing 
disease of the central nervous system (CNS) of which 
the complete cause still remains unknown. 

Numerous studies have supported the hypothesis 
for a viral etiology of the disease, but none of the 
known viruses tested has proved to be the causative 
agent tested for: a review of the viruses tested for in 
MS for many years has been carried out by E. Norrby and 
R.T. Johnson. 

15 Recently, a retrovirus, different from the 

known human retroviruses, was isolated from patients 
suffering from MS. The authors were able to show that 
this retrovirus could be transmitted in vitro, that 
patients suffering from MS produced antibodies capable 

2 0 of recognizing proteins associated with the infection 
of the leptomeningeal cells by this retrovirus, and 
that the expression of the latter could be greatly 
stimulated by the immediate-early genes of some 
herpesviruses . 

25 All these results argue in favor of the role in 

MS of at least one unknown retrovirus or of a virus 
having a reverse transcriptase (RT) activity which is 
detectable by the method published by H. Perron and 
termed w LM7-type RT" activity. 

30 Th e studies by the applicant have made it 

possible to obtain two continuous cell lines infected 
with natural isolates obtained from two different 
patients suffering from MS, by a culture method as 
described in the document WO-A-93 20188, whose content 

35 is incorporated by reference into the present 
description. These two lines derived from cells of 
human choroid plexus, called LM7PC and PLI-2, were 
deposited at the E.C.A.C.C. on 22 July 1992 and 
8 January 1993, respectively, under numbers 92 072201 
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and 93 010817, in accordance with the provisions of the 
Treaty of Budapest. Moreover, the viral isolates 
possessing an LM7-type RT activity have also been 
deposited at the E.C.A.C.C. under the overall name of 
5 "strains*. The "strain" or isolate harbored by the 
PLI-2 line, called POL-2, was deposited at the 
E.C.A.C.C. on 22 July 1992 under No. V92072202. The 
"strain" or isolate harbored by the LM7PC line, called 
MS7PG, was deposited at the E.C.A.C.C. on 8 January 

10 1993 under No. V93010816. 

Using the abovementioned cultures and isolates, 
characterized by biological and morphological criteria, 
efforts were then made to characterize the genetic 
material associated with the viral particles produced 

15 in these cultures. 

The proportions of genome already characterized 
were used to develop molecular detection tests for the 
viral genome and immunoserological tests, using the 
amino acid sequences encoded by the nucleotide 

2 0 sequences of the viral genome, in order to detect the 

immune response directed against epitopes associated 
with the viral infection and/or expression. 

These tools have already made it possible to 
confirm an association between MS and the expression of 
25 the sequences identified in the patents cited further 
on. However, the viral system discovered by the 
applicant is related to a complex retroviral system. 
Indeed, the sequences which are found to be 
encapsidated in the extracellular viral particles 

3 0 produced by the different cultures of cells of patients 

suffering from MS show clearly that there is co- 
encapsidation of retroviral genomes which are related 
but different from the w wild-type" retroviral genome 
which produces the infectious viral particles. This 
3 5 phenomenon was observed between replicative 
retroviruses and endogenous retroviruses belonging to 
the same family, or even heterologous retroviruses. The 
concept of endogenous retrovirus is very important in 
the context of our discovery because, in the case of 
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MSRV-1, it has been observed that endogenous retroviral 
sequences comprising sequences homologous to the MSRV-1 
genome exist in normal human DNA. The existence of 
endogenous retroviral elements (ERV) related to MSRV-1 
5 through all or part of their genome explains the fact 
that the expression of the MSRV-1 retrovirus in human 
cells can interact with related endogenous sequences . 
These interactions are found in the case of pathogenic 
and/or infectious endogenous retroviruses (for example 

10 some ecotropic strains of the Murine Leukemia virus) , 
in the case of exogenous retroviruses whose nucleotide 
sequence may be found partially or completely in the 
form of ERVs, in the genome of the host animal (e.g. 
mouse mammary tumor exogenous virus transmitted via 

15 milk) . These interactions consist mainly of (i) a 
transactivation or co-activation of ERVs by the 
replicative retrovirus, (ii) an w illegitimate" 
encapsidation of related RNAs of ERVs, or of ERVs - or 
even of cellular RNAs - simply possessing compatible 

20 encapsidation sequences, into the retroviral particles 
produced by the expression of the replicative strain, 
which are sometimes transmissible and sometimes with an 
inherent pathogenicity, and (iii) relatively high 
recombinations between the co-encapsidated genomes, in 

25 particular in the reverse transcription phases, which 
lead to the formation of hybrid genomes, which are 
sometimes transmissible and sometimes with an inherent 
pathogenic i ty . 

Thus, (i) various MSRV-1 -related sequences have 

3 0 been found in purified viral particles; (ii) molecular 
analysis of the various regions of the MSRV-1 
retroviral genome should be carried out by 
systematically analyzing the co-encapsidated, 

interfering and/or recombinant sequences which are 

3 5 generated by the infection and/or expression of MSRV-1; 
furthermore, some clones may have portions of defective 
sequences produced by the retroviral replication and 
the template and/or transcription errors caused by 
reverse transcriptase; (iii) the families of sequences 
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related to the same retroviral genomic region are the 
supports for an overall diagnostic detection which may 
be optimized by the identification of invariable 
regions among the clones expressed and by the 
5 identification of reading frames responsible for the 
production of antigenic and/or pathogenic polypeptides 
which may only be produced by a portion, or even only 
one, of the clones expressed and under these 
conditions, the systematic analysis of the clones 

10 expressed in one region of a given gene makes it 
possible to evaluate the frequency of variation and/or 
recombination of the MSRV-1 genome in this region and 
to define the optimum sequences for the applications, 
in particular the diagnostic applications; (iv) the 

15 pathology caused by a retrovirus such as MRSV-1 may be 
a direct effect of its expression and of the proteins 
or peptides produced as a result, but also an effect of 
the activation, encapsidation, recombination of related 
or heterologous genomes and proteins or peptides 

2 0 produced as a result; thus, these genomes associated 

with the expression and/or infection by MSRV-1 are an 
integral part of the potential pathogenicity of this 
virus and therefore constitute diagnostic detection 
supports and particular therapeutic targets. Likewise, 
25 any agent which is associated with, or which is a 
cofactor for these interactions responsible for the 
pathogenicity in question, such as MSRV-2 or the 
gliotoxic factor described in the patent application 
published under the No. FR-2 , 716 , 198 , can participate 

3 0 in the development of an overall and very effective 

strategy for therapeutic diagnosis, prognosis, 
monitoring and/or integrated therapy for MS in 
particular, but also for any other disease associated 
with the same agents. 
3 5 In this context, a parallel discovery has been 

made in another autoimmune disease, rheumatoid 
arthritis (RA) , which has been described in the French 
patent application filed under the No. 95 02960. This 
discovery shows that, by applying methodological 
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approaches similar to those which were used in the 
studies by the applicant on MS, it has been possible to 
identify a retrovirus expressed in RA which shares the 
sequences described for MSRV-1 in MS and also the co- 
5 existence of an MSRV-2 -associated sequence which is 
also described in MS. As regards MSRV-1, the sequences 
commonly detected in MS and RA relate to the pol and 
gag genes. On the basis of current knowledge, it is 
possible to combine the gag and pol sequences described 
10 with the MSRV-1 strains expressed in these two 
diseases . 

The present patent application has as its 
object various results, supplementary in relation to 
those already protected by the French patent 
15 applications: 
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35 - Patent Application WO 97/06260. 

The present invention relates, first of all, to 
a nucleic material, which may consist of a retroviral 
material, in isolated or purified state, which may be 
understood or characterized in various ways: 
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- it comprises a nucleotide sequence chosen 
from the group which consists of (i) the sequences 

SEQ ID NO: 112, SEQ ID NO: 114, SEQ ID NO: 117, 

SEQ ID NO: 120, SEQ ID NO: 124, SEQ ID NO: 130, 

5 SEQ ID NO: 141 and SEQ ID NO: 142; (ii) the sequences 
complementary to sequences (i) ; and (iii) the sequences 
equivalent to sequences (i) or (ii) , in particular the 
sequences having, for every series of 100 contiguous 
monomers, at least 50%, and preferentially at least 70% 
10 homology with sequences (i) or (ii) respectively; 

- it encodes a polypeptide having, for every 
contiguous series of at least 3 0 amino acids, at least 
50%, and preferably at least 70% homology with a 
peptide sequence chosen from the group which consists 

15 of SEQ ID NO: 113, SEQ ID NO: 115, SEQ ID NO: 118, 
SEQ ID NO: 121, SEQ ID NO: 135 and SEQ ID NO: 137; 

- its pol gene comprises a nucleotide sequence 
identical or equivalent to a sequence chosen from the 
group which consists of SEQ ID NO: 112, SEQ ID NO: 124 

20 and their complementary sequences; 

- the 5' end of its pol gene starts at 
nucleotide 1419 of SEQ ID NO: 13 0; 

- its pol gene encodes a polypeptide having, 
for every contiguous series of at least 3 0 amino acids, 

25 at least 50%, and preferably at least 70% homology with 
the peptide sequence SEQ ID NO: 113; 

- the 3' end of its gag gene ends at nucleotide 
1418 of SEQ ID NO: 13 0; 

- its env gene comprises a nucleotide sequence 
3 0 identical or equivalent to a sequence chosen from the 

group which consists of SEQ ID NO: 117, and its 
complementary sequences; 

- its env gene comprises a nucleotide sequence 
which starts at nucleotide 1 of SEQ ID NO: 117 and ends 

35 at nucleotide at nucleotide [sic] 233 of SEQ ID NO: 
114; 

- its env gene encodes a polypeptide having, 
for every contiguous series of at least 3 0 amino acids, 
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at least 50%, and preferably at least 70% homology with 
the sequence SEQ ID NO: 11-8; 

- the U3R region of its 3' LTR comprises a 
nucleotide sequence which ends at nucleotide 617 of 

5 SEQ ID NO: 114; 

- the RU5 region of its 5' LTR comprises a 
nucleotide sequence which starts at nucleotide 755 of 
SEQ ID NO: 120 and ends at nucleotide 337 of SEQ ID NO: 
141 or SEQ ID NO: 142; 

10 - a retroviral nucleic material comprising a 

sequence which starts at nucleotide 755 of SEQ ID NO: 
12 0 and which ends at nucleotide 617 of SEQ ID NO: 114; 

- the retroviral nucleic material as defined 
above is in particular associated with at least one 

15 autoimmune disease such as multiple sclerosis or 
rheumatoid arthritis. 

The invention also relates to a nucleotide 
fragment which corresponds to at least one of the 
following definitions: 

20 - it comprises or consists of a nucleotide 

sequence chosen from the group which consists of (i) 
the sequences SEQ ID NO: 112, SEQ ID NO: 114, 

SEQ ID NO: 117, SEQ ID NO: 120, SEQ ID NO: 124, 

SEQ ID NO: 13 0, SEQ ID NO: 141 and SEQ ID NO: 142; (ii) 

25 the sequences complementary to sequences (i) ; and (iii) 
the sequences equivalent to sequences (i) or (ii) , in 
particular the sequences having, for every series of 
100 contiguous monomers, at least 50%, and 
preferentially at least 70% homology with sequences (i) 

3 0 or (ii) respectively; 

- it comprises or consists of a nucleotide 
sequence encoding a polypeptide having, for every 
contiguous series of at least 30 amino acids, at least 
50%, and preferably at least 70% homology with a 

35 peptide sequence chosen from the group which consists 
of SEQ ID NO: 113, SEQ ID NO: 115, SEQ ID NO: 118, 
SEQ ID NO: 121, SEQ ID NO: 135 and SEQ ID NO: 137. 

Other subjects of the present invention are the 
following: 
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- a nucleic probe for the detection of a 
retrovirus associated with multiple sclerosis and/or 
rheumatoid arthritis, capable of hybridizing 
specifically with any fragment defined above and 

5 belonging to the genome of said retrovirus; it 
advantageously possesses from 10 to 100 nucleotides, 
preferably from 10 to 3 0 nucleotides; 

- a primer for the amplification, by 
polymerization, of an RNA or of a DNA of a retrovirus 

10 associated with multiple sclerosis and/or rheumatoid 
arthritis, which comprises a nucleotide sequence 
identical or equivalent to at least a portion of the 
nucleotide sequence of a fragment defined above, in 
particular a nucleotide sequence having, for every 

15 series of 10 contiguous monomers, at least 50%, 
preferably at least 70% homology with at least said 
portion of said fragment; preferably the nucleotide 
sequence of a primer of the invention is chosen from 
SEQ ID NO: 116, SEQ ID NO: 119, SEQ ID NO: 122, 

20 SEQ ID NO: 123, SEQ ID NO: 126, SEQ ID NO: 127, 

SEQ ID NO: 128, SEQ ID NO: 129, SEQ ID NO: 132, and 
SEQ ID NO: 133; 

- an RNA or a DNA, and in particular a 
replication and/or expression vector, comprising a 

25 genomic fragment of the nucleic material or a fragment 
defined above; 

- a peptide encoded by any open reading frame 
belonging to a nucleotide fragment defined above, in 
particular a polypeptide, for example oligopeptide 

3 0 forming or comprising an antigenic determinant 
recognized by sera of patients infected with the MSRV-1 
virus, and/or in whom the MSRV-1 virus has been 
reactivated; a preferential peptide comprises a 
sequence identical, partially or completely, or 

3 5 equivalent to a sequence chosen from SEQ ID NO: 113, 
SEQ ID NO: 115, SEQ ID NO: 118, SEQ ID NO : 121, 

SEQ ID NO: 135 and SEQ ID NO: 137; 

- a diagnostic, prophylactic or therapeutic 
composition, in particular for inhibiting the 




- 9 - 

expression of at least one retrovirus associated with 
multiple sclerosis and/or rheumatoid arthritis, 
comprising a nucleotide fragment defined above; 

- a method for detecting a retrovirus 
5 associated with multiple sclerosis and/or rheumatoid 

arthritis, in a biological sample, comprising the steps 
consisting of bringing an RNA and/or a DNA assumed to 
belong to or obtained from said retrovirus, or their 
complementary RNA and/or DNA, into contact with a 
10 composition comprising a nucleotide fragment defined 
above . 

Before detailing the invention, various terms 
used in the description and the claims are now defined: 

- strain or isolate is understood to mean any 
15 infectious and/or pathogenic biological fraction 

containing, for example, viruses and/or bacteria and/or 
parasites, generating a pathogenic and/or antigenic 
power, harbored by a culture or a live host; by way of 
example, a viral strain according to the preceding 
20 definition may contain a co-infectious agent, for 
example a pathogenic protist, 

- the term "MSRV" used in the present 
description designates any pathogenic and/or infectious 
agent, as associated with MS, in particular a viral 

25 species, the attenuated strains of said viral species, 
or the interfering defective particles or particles 
containing co-encapsidated genomes or alternatively 
genomes recombined with a portion of the MSRV-1 genome, 
which are derived from this species . It is known that 

3 0 viruses and particularly viruses containing RNA exhibit 
variability, following in particular relatively high 
rates of spontaneous mutation, which will be taken into 
account below to define the concept of equivalence, 

- human virus is understood to mean a virus 
3 5 capable of infecting or of being harbored by human 

beings, 

- given all the natural or induced variations 
and/or recombination which may be encountered in 
practice in the present invention, the objects thereof, 
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defined above and in the claims, have been expressed by 
comprising the equivalents or derivatives of the 
various biological materials defined below, in 
particular homologous nucleotide or peptide sequences, 
5 - the variant of a virus or of a pathogenic 

and/or infectious agent according to the invention 
comprises at least one antigen recognized by at least 
one antibody directed against at least one 
corresponding antigen of said virus and/or of said 

10 pathogenic and/or infectious agent, and/or a genome in 
which any portion is detected by at least one 
hybridization probe, and/or at least one nucleotide 
amplification primer specific for said virus and/or 
pathogenic and/or infectious agent, under defined 

15 hybridization conditions well known to persons skilled 
in the art, 

- according to the invention, a nucleotide 
fragment or an oligonucleotide or a polynucleotide is a 
stretch of monomers, or a biopolymer, characterized by 

2 0 the informational sequence of the natural nucleic 

acids, which is capable of hybridizing to any other 
nucleotide fragment under predefined conditions, it 
being possible for the stretch to contain monomers of 
different chemical structures and to be obtained from a 
25 natural nucleic acid molecule and/or by genetic 
recombination and/or by chemical synthesis; a 
nucleotide fragment may be identical to a genomic 
fragment of the MSRV-1 virus considered by the present 
invention, in particular a gene of the latter, for 

3 0 example pol or env in the case of said virus; 

- thus, a monomer may be a natural nucleic acid 
nucleotide in which the constituent components are a 
sugar, a phosphate group and a nitrogen base; in RNA, 
the sugar is ribose; in DNA, the sugar is 2-deoxy- 

3 5 ribose; depending on whether DNA or RNA is involved, 
the nitrogen base is chosen from adenine, guanine, 
uracil, cytosine, thymine; or the nucleotide may be 
modified in at least one of the three constituent 
components; by way of example, the modification may 
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occur at the level of the bases, generating modified 
bases such as inosine, 5 -methyl -deoxycytidine, 
deoxyuridine, 5-dimethylamineodeoxyuridine [sic] , 

2 , 6-diamineopurine [sic], 5-bromodeoxyuridine and any 
5 other modified base promoting hybridization; at the 
level of the sugar, the modification may consist in the 
replacement of at least one deoxyribose with a 
polyamide, and at the level of the phosphate group, the 
modification may consist in its replacement with 
10 esters, in particular chosen from the esters of 
diphosphate, of alkyl and arylphosphonate and of 
phosphorothioate , 

- u informational sequence" is understood to 
mean any ordered series of monomers, whose chemical 

15 nature and in which the order in a reference direction, 
constitute or otherwise a functional information of the 
same quality as that for the natural nucleic acids, 

- hybridization is understood to mean the 
process during which, under appropriate operating 

2 0 conditions, two nucleotide fragments, having 

sufficiently complementary sequences, become annealed 
to form a complex, in particular a double or triple, 
structure, preferably in helical form, 

- a probe comprises a nucleotide fragment 
25 synthesized by the chemical route or obtained by 

digestion or enzymatic cleavage of a longer nucleotide 
fragment, comprising at least six monomers, 
advantageously from 10 to 100 monomers, preferably 10 
to 30 monomers, and possessing a hybridization 

3 0 specificity under defined conditions; preferably, a 

probe possessing less than 10 monomers is not used 
alone, but is used in the presence of other probes 
which are equally short in length or otherwise; under 
certain specific conditions, it may be useful to use 
3 5 probes which are greater than 100 monomers in size; a 
probe may be used in particular for diagnostic 
purposes, and it may be, for example, capture and/or 
detection probes, 
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- the capture probe may be immobilized on a 
solid support by any appropriate means, that is to say 
directly or indirectly, for example by covalent bonding 
or passive adsorption, 

5 - the detection probe may be labeled by means 

of a marker chosen in particular from radioactive 
isotopes, enzymes chosen in particular from peroxidase 
and alkaline phosphatase and those capable of 
hydrolyzing a chromogenic , fluorigenic or luminescent 
10 substrate, chromophoric chemical compounds, 

chromogenic, fluorigenic or luminescent compounds, 
analogs of nucleotide bases, and biotin, 

- the probes used for diagnostic purposes of 
the invention may be used in all known hybridization 

15 techniques, and in particular the so-called "DOT-BLOT" 
technique, w SOUTHERN BLOT" technique, "NORTHERN BLOT" 
technique which is a technique identical to the 
"SOUTHERN BLOT" technique but which uses RNA as target, 
the SANDWICH technique; advantagously , the SANDWICH 

2 0 technique is used in the present invention, comprising 
a specific capture probe and/or a specific detection 
probe, it being understood that the capture probe and 
the detection probe must have a nucleotide sequence 
which is at least partially different, 

2 5 - any probe according to the present invention 

may hybridize in vivo or in vitro with the RNA and/or 
with the DNA, in order to block the replication, in 
particular translation and/ or transcription, phenomena 
and/ or to degrade said DNA and/ or RNA, 

3 0 - a primer is a probe comprising at least six 

monomers, and advantageously from 10 to 3 0 monomers, 
possessing hybridization specificity under defined 
conditions, for the initiation of an enzymatic 
polymerization, for example in an amplification 
3 5 technique such as PCR (Polymerase Chain Reaction) , in 
an extension method such as sequencing, in a reverse 
transcription method and the like, 

- two nucleotide or peptide sequences are said 
to be equivalent or derived with respect to each other, 
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or with respect to a reference sequence, if 
functionally the corresponding biopolymers can play 
substantially the same role, without being identical, 
in relation to the application or use considered, or in 
5 the technique in which they are involved; particularly 
equivalent are two sequences obtained because of the 
natural variability, in particular spontaneous 
mutation, of the species from which they were 
identified, or induced mutation, as well as two 
10 homologous sequences,, the homology being defined below, 

- "variability" is understood to mean any 
spontaneous or induced modification of a sequence, in 
particular by substitution, and/or insertion, and/or 
deletion of nucleotides and/or of nucleotide fragments, 

15 and/or extension and/or shortening of the sequence at 
least at one of the ends; a nonnatural variability may 
result from the genetic engineering techniques used, 
for example from the choice of the degenerate or 
nondegenerate synthetic primers selected to amplify a 

20 nucleic acid; this variability may result in 
modifications of any starting sequence, considered as a 
reference, and which may be expressed by a degree of 
homology with respect to said reference sequence, 

- homology characterizes the degree of identity 
25 of two compared nucleotide or peptide fragments; it is 

measured by the percentage identity which is in 
particular determined by direct comparison of 
nucleotide or peptide sequences, with respect to 
reference nucleotide or peptide sequences, 

3 0 - any nucleotide fragment is said to be 

equivalent to or derived from a reference fragment if 
it has a nucleotide sequence equivalent to the sequence 
of the reference fragment; according to the preceding 
definition, in particular equivalent to a reference 

3 5 nucleotide fragment are: 

(a) any fragment capable of hybridizing, at 
least partially, with the complementary to the 
reference fragment , 
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(b) any fragment whose alignment with the 
reference fragment leads to the identification of 
identical contiguous bases, in a greater number than 
with any other fragment obtained from another taxonomic 

5 group, 

(c) any fragment resulting or capable of 
resulting from the natural variability of the species 
from which it is obtained, 

(d) any fragment which may result from genetic 
10 engineering techniques applied to the reference 

fragment, 

(e) any fragment, containing at least eight 
contiguous nucleotides, encoding a peptide homologous 
or identical to the peptide encoded by the reference 

15 fragment, 

(f) any fragment different from the reference 
fragment through insertion,, deletion, substitution of 
at least one monomer, extension, or shortening at least 
at one of its ends; for example, any fragment 

20 corresponding to the reference fragment, flanked at 
least at one of its ends by a nucleotide sequence not 
encoding a polypeptide, 

- polypeptide is understood to mean in 
particular any peptide of at least two amino acids, in 

25 particular oligopeptide, protein, extracted, separated, 
or substantially isolated or synthesized, through the 
involvement of humans, in particular those obtained by 
chemical synthesis, or through expression in a 
recombinant or gani sm , 

3 0 - polypeptide partially encoded by a nucleotide 

fragment is understood to mean a polypeptide having at 
least three amino acids encoded by at least nine 
contiguous monomers included in said nucleotide 
fragment, 

3 5 - an amino acid is said to be analogous to 

another amino acid when their respective 
physicochemical characteristics, such as polarity, 
hydrophobicity and/or basicity, and/or acidity, and/or 
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neutrality, are substantially the same; thus, a leucine 
is analogous to an isoleucine, 

- any polypeptide is said to be equivalent to 
or derived from a reference polypeptide if the 

5 polypeptides compared have substantially the same 
properties, and in particular the same antigenic, 
immunological, enzymatic and/or molecular recognition 
properties; in particular equivalent to a reference 
polypeptide is: 

10 (a) any polypeptide possessing a sequence in 

which at least one amino acid has been replaced by an 
analogous amino acid, 

(b) any polypeptide having an equivalent 
peptide sequence, obtained by natural or induced 

15 variation of said reference polypeptide, and/or of the 
nucleotide fragment encoding said polypeptide, 

(c) a mimotope of said reference polypeptide, 

(d) any polypeptide from whose sequence one or 
more amino acids of the L series are replaced by an 

20 amino acid of the D series, and vice versa, 

(e) any polypeptide into whose sequence a 
modification of the side chains of the amino acids has 
been introduced, such as for example an acetylation of 
the amine-containing functions, a carboxylation of the 

25 thiol functions, an esterif ication of the carboxyl 
functions, 

(f) any polypeptide in whose sequence one or 
more peptide bonds have been modified, such as for 
example the carba, retro, inverso, retro-inverso, 

30 reduced, and methylene-oxy bonds, 

(g) any polypeptide in which at least one 
antigen is recognized by an antibody directed against a 
reference polypeptide, 

- the percentage identity characterizing the 
3 5 homology between two peptide fragments compared is 

according to the present invention at least 50% and 
preferably at least 70%, 

Given that a virus possessing a reverse 
transcriptase enzymatic activity may be genetically 
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characterized both in RNA and DNA form, both the viral 
DNA and RNA will be mentioned in order to characterize 
the sequences relative to a virus possessing such a 
reverse transcriptase activity, termed MSRV-1 according 
5 to the present description. 

The expressions of order which are used in the 
present description and the claims, such as "first 
nucleotide sequence", are not selected to express a 
particular order, but to define the invention more 
10 clearly. 

Detection of a substance or agent is understood 
below to mean an identification, a quantification or a 
separation or isolation of said substance or of said 
agent . 

15 The invention will be understood more clearly 

on reading the detailed description which follows which 
is made with reference to the appended figures in 
which: 

Figure 1 represents the general structure of 
20 the proviral DNA and the genomic RNA of MSRV-1. 

Figure 2 represents the nucleotide sequence of 
the clone called CL6-5' (SEQ ID NO: 112) and three 
potential reading frames in amino acids presented under 
the nucleotide sequence. 
25 Figure 3 represents the nucleotide sequence of 

the clone called CL6-3' (SEQ ID NO: 114) and three 
potential reading frames in amino acids presented under 
the nucleotide sequence. 

Figure 4 represents the nucleotide sequence of 
3 0 the clone called C15 (SEQ ID NO: 117) and three 
potential reading frames in amino acids presented under 
the nucleotide sequence. 

Figure 5 represents the nucleotide sequence of 
the clone called 5M6 (SEQ ID NO: 120) and three 
3 5 potential reading frames in amino acids presented under 
the nucleotide sequence. 

Figure 6 represents the nucleotide sequence of 
the clone called CL2 (SEQ ID NO: 13 0) and three 
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potential reading frames in amino acids presented under 
the nucleotide sequence. 

Figure 7 represents three potential reading 
frames in amino acids expressed by pET28C-clone 2 and 
5 presented under the nucleotide sequence. 

Figure 8 represents three potential reading 
frames in amino acids expressed by pET21C-clone 2 and 
presented under the nucleotide sequence. 

Figure 9 represents the nucleotide sequence of 

10 the clone called LB13 (SEQ ID NO: 141) and three 
potential reading frames in amino acids presented under 
the nucleotide sequence. 

Figure 10 represents the nucleotide sequence of 
the clone called LA15 (SEQ ID NO: 142) and three 

15 potential reading frames in amino acids presented under 
the nucleotide sequence. 

Figure 11 represents the nucleotide sequence of 
the clone called LB16 (SEQ ID NO: 124) and three 
potential reading frames in amino acids presented under 

20 the nucleotide sequence. 

Figure 12 represents the promoter activity 
expressed in cpm/4 min of the U3R sequences subcloned 
from LTRs of different origins into the plasmid PC AT 3 . 
PCAT3 means plasmid alone, PCAT-PH74 means plasmid plus 

25 endogenous U3R clone expressed in the placenta, PCAT- 
cl6 means plasmid plus U3R clone amplified in the RNA 
of an MS plasma, PCAT-5M6 means plasmid plus U3R region 
amplified in the cellular DNA, "no plasmid" means 
absence of plasmid in the test. 

3 0 Figure 13 represents the MSRVl env and 3' LTR 

sequences. The horizontal arrows indicate the start of 
the env, U3 and R regions. In the env region, the 
signal peptide and the potential immunosuppressive 
region are underlined, the potential glycosilation 

35 sites are boxed and the potential cleavage sites are 
indicated by vertical arrows. In the U3R region: the 
regulatory element CAAT and the TATA Box are 
underlined, the "cap" site and the polyadenylation 
signal are also indicated. 
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Figure 14 represents the 5' LTR (RUB ) region 
followed by a PBS site (primer binding site) 
complementary to the Trp tRNA and by a gag gene 
encoding a protein of about 487 amino acids. The amino 
5 acids conserved in the nucleocapsid are underlined 
twice. The amino acids defining the region of greatest 
homology in the capsid are in bold and underlined once. 
The / symbols in the amino acid sequence indicate 
variations observed depending on the clones and, in the 
10 nucleotide sequence, they indicate frame jumps in some 
clones. The boxed regions correspond to epitopes 
identified by peptide analysis of the C-terminal 
region. 

Figure 15 represents the integrase region of 
15 MSRV1, the nucleotide sequence and the amino acid 
sequence deduced from the integrase region 
corresponding to clone 87-23. In Figure 15, // means a 
frame jump which has been suppressed in order to 
restore the potential ORF. The letters in underlined 

2 0 bold characters represent the conserved amino acids in 

the retroviral integrases . 

Figure 16 describes the nucleotide and peptide 
sequences of clone B13 (identical to clone FBdl3 
described in previous applications) with indication of 
25 the ORFs and stop codons represented by a dot. The 
underlined region in bold represents the potential 
immunosuppressive domain. The single underlined domain 
represents the start of the 3' LTR. 

3 0 EXAMPLE 1; PREPARATION OF A CL6-5' REGION ENCODING THE 

N- TERMINAL END OF INTEGRASE AND OF A CL6-3 ' REGION 
CONTAINING THE 3 ' TERMINAL SEQUENCE OF THE MSRV-1 
GENOME 

A 3 ' RACE was carried out on the total RNA 
35 extracted from plasma from a patient suffering from MS. 
A healthy control plasma, treated under the same 
conditions, was used as negative control. The synthesis 
of cDNA was carried out with an oligo dT primer 
identified by SEQ ID NO: 68 (5' GAC TCG CTG CAG ATC GAT 
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TTT TTT TTT TTT TTT T 3 ' ) and the reverse transcriptase 
"Expand™ RT" from Boehringer according to the 
conditions recommended by the company. A PCR was 
carried out with the enzyme Klentaq (Clontech) under 
5 the following conditions: 94°C 5 min then 93 °C 1 min, 
58°C 1 min, 68°C 3 min over 40 cycles and 68°C for 
8 min, with a final reaction volume of 50 |il . 
Primers used for the PCR: 

- 5' primer, identified by SEQ ID NO: 69 

10 5 ' GCC ATC AAG CCA CCC AAG AAC TCT TAA CTT 3' ; 

- 3' primer, identified by SEQ ID NO: 68 

A second so-called "seminested" PCR was carried 
out with a 5' primer situated inside the region already 
amplified. This second PCR was carried out under the 
15 same experimental conditions as those used for the 
first PCR, using 10 jil of the amplification product 
derived from the first PCR. 
Primers used for the seminested PCR: 

- 5' primer, identified by SEQ ID NO: 70 

20 5' CCA ATA GCC AGA CCA TTA TAT ACA CTA ATT 3' ; 

- 3' primer, identified by SEQ ID NO: 68 

The primers SEQ ID NO: 69 and SEQ ID NO: 70 are 
specific for the pol region of MRSV-1. 

An amplification product of 1.9 Kb was obtained 
25 for the plasma of the MS patient. The corresponding 
fragment was not observed for the healthy control 
plasma. This amplification product was cloned in the 
following manner: 

The amplified DNA was inserted into a plasmid with the 
3 0 aid of the TA Cloning kit®. The 2 jil of DNA solution 
were mixed with 5 |il of sterile distilled water, 1 |il 
of a 10 times concentrated ligation buffer "10X 
LIGATION BUFFER", 2 of u pCR™ VECTOR" (25 ng/ml) and 
1 (J.1 of W T4 DNA LIGASE" . This mixture was incubated 
35 overnight at 12°C. The next steps were carried out in 
accordance with the instructions for the TA Cloning 
kit® (Invitrogen) . At the end of the procedure, the 
white colonies of recombinant bacteria (white) were 
subcultured so as to be cultured and allow the 
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extraction of the plasmids incorporated according to 
the so-called "miniprep" procedure. The plasmid 
preparation of each recombinant colony was cut with an 
appropriate restriction enzyme and analyzed on agarose 
5 gel. The plasmids possessing an insert detected under 
UV light after staining the gel with ethidium bromide 
were selected for the sequencing of the insert after 
hybridization with a primer complementary to the Sp6 
promoter present on the cloning plasmid of the TA 
10 cloning kit®. The reaction prior to the sequencing was 
then carried out according to the method recommended 
for using the sequencing kit "PRISM™ Ready Reaction 
AmpliTaq® FS, DyeDeoxy™ Terminator" (Applied 
Biosystems, ref. 402119) and the automated sequencing 
15 was carried out on the Applied Biosystems 373 A and 377 
apparatus, according to the manufacturer's 

instructions . 

The clone obtained contains a CL6-5' region 
encoding the N-terminal end of integrase and a CL6-3 ' 
20 region corresponding to the 3' terminal region of 
MSRV-1 and making it possible to define the end of the 
envelope (234 bp) and the U3 and R (401 bp) regions of 
the MSRVl retrovirus . 

The region corresponding to the N-terminal end 
25 of integrase is represented by its nucleotide sequence 
(SEQ ID NO: 112) in Figure 27. The three potential 
reading frames are presented by their amineo [sic] acid 
sequence under the nucleotide sequence, and the amineo 
[sic] acid sequence of the N-terminal end of integrase 
30 is identified by SEQID NO: 113. 

The C16-3' region is represented by its 
nucleotide sequence (SEQ ID NO: 114) in Figure 3. The 
three potential reading frames are presented by their 
amineo [sic] acid sequence under the nucleotide 
35 sequence. An amineo [sic] acid sequence corresponding 
to the C-terminal end of the MSRV-1 env protein is 
identified by SEQ ID NO: 115. 

In order to evaluate the promoter activity of 
the LTR obtained from clone 6 (cl6) , a test of promoter 
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activity using the enzyme CAT (chloramphenicol acetyl 
transferase) was carried out with the corresponding U3R 
region. In parallel, a clone containing the same U3R 
region of endogenous retroviral RNA expressed in normal 
5 placenta (PH74) and a clone (5M6) obtained from DNA 
were tested. The result presented in Figure 12 shows a 
very high promoter activity of the LTR derived from MS 
plasma (cl6) and a significantly much lower activity 
with the sequences of non-MS endogenous origin. 

10 

EXAMPLE 2 : PREPARATION OF THE CIS CLONE CONTAINING THE 
REGION ENCODING A PORTION OF THE MSRV-1 RETROVIRUS 
ENVELOPE 

A RT-PCR was carried out on the total RNA 
15 extracted from virions concentrated by ultra- 
centrifugation of a synoviocyte culture supernatant 
obtained from an MS patient. The synthesis of cDNA was 
carried out with an oligo dT primer and the reverse 
transcriptase "Expand™ RT" from Boehringer according to 
20 the conditions recommended by the company. A PCR was 
carried out with the Expand™ Long Template PCR System 
(Boehringer) under the following conditions: 94°C 5 min 
then 93°C 1 min, 60°C 1 min, 68°C 3 min over 40 cycles 
and 68°C for 8 min and with a final reaction volume of 
25 50 nl. 

Primers used for the PCR: 

- 5' primer, identified by SEQ ID NO: 69 

5 ' GCC ATC AAG CCA CCC AAG AAC TCT TAA CTT 3 ' ; 

- 3' primer, identified by SEQ ID NO: 116 

3 0 5 ' TGG GGT TCC ATT TGT AAG ACC ATC TGT AGC TT 3 ' 

A second so-called "seminested" PCR was carried 
out with a 5' primer situated inside the region already 
amplified. This second PCR was carried out under the 
same experimental conditions as those used for the 

3 5 first PCR {except that 3 0 cycles were used instead of 
40), using 10 \xl of the amplification product derived 
from the first PCR. 

Primers used for the seminested PCR: 

- 5' primer, identified by SEQ ID NO: 7 0 
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5 ' CCA ATA GCC AGA CCA TTA TAT ACA CTA ATT 3 ' ; 
- 3' primer, identified by SEQ ID NO: 116 
The primers SEQ ID NO: 69 and SEQ ID NO: 70 are 
specific for the pol region of MRSV-1. The primer SEQ 
5 ID NO: 116 is specific for the sequence FBdl3 (also 
called B13) and is located in the conserved env region 
among the oncoretroviruses . 

An amplfication product of 1932 bp was obtained 
and cloned in the following manner: 

10 the amplified DNA was inserted into a plasmid with the 
aid of the TA Cloning kit®. The various steps were 
carried out in accordance with the instructions for the 
TA Cloning kit® (Invitrogen) . At the end of the 
procedure, the white colonies of recombinant bacteria 

15 (white) were subcultured so as to be cultured and allow 
the extraction of the plasmids incorporated according 
to the so-called "miniprep" procedure. The plasmid 
preparation of each recombinant colony was cut with an 
appropriate restriction enzyme and analyzed on agarose 

2 0 gel . The plasmids possessing an insert detected under 

UV light after staining the gel with ethidium bromide 
were selected for the sequencing of the insert after 
hybridization with a primer complementary to the SP6 
promoter present on the cloning plasmid of the TA 
25 cloning kit®. The reaction prior to the sequencing was 
then carried out according to the method recommended 
for using the sequencing kit "PRISM™ Ready Reaction 
AmpliTaq® FS, DyeDeoxy™ Terminator" (Applied 
Biosystems, ref. 402119) and the automated sequencing 

3 0 was carried out on the Applied Biosystems 373 A and 377 

apparatus, according to the manufacturer's 
instructions . 

The C15 clone obtained contains a region 
corresponding to the region of the MSRV-1 envelope of 
35 1481 bp. 

The env region of the C15 clone is represented 
by its nucleotide sequence (SEQ ID NO: 117) in 
Figure 5. The three potential reading frames of this 
clone are presented by their amineo [sic] acid sequence 
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under the nucleotide sequence. The reading frame 
corresponding to an MSRV-1 structural env protein is 
identified by SEQ ID NO: 118. 

From the defined sequences obtained from clones 
5 cl6 and C15, it was possible to produce a plasmid 
construct encoding a complete envelope followed by the 
3' LTR, as presented in Figure 13 with the 
corresponding reading frame. 

10 EXAMPLE 3 : PREPARATION OF A 5M6 CLONE CONTAINING THE 
SEQUENCES OF THE 3' TERMINAL REGION OF THE ENVELOPE, 
FOLLOWED BY THE MSRV-1 PROVIRAL TYPE U3 , R AND U5 
SEQUENCES 

A monodirectional PCR was carried out on the 

15 DNA extracted from immortalized B lymphocytes in 
culture from an MS patient. The PCR was carried out 
with Expand™ Long Template PCR System (Boehringer) 
under the following conditions: 94°C 3 min then 93 °C 
1 min, 60°C 1 min, 68°C 3 min over 10 cycles, then 93°C 

20 1 min, 60°C 1 min with 15 sec of extension at each 
cycle, 68°C 3 min over 35 cycles and 68°C for 7 min and 
with a final reaction volume of 50 jal . 

The primer used for the PCR identified by SEQ 
ID NO: 119 is 5' TCA AAA TCG AAG AGC TTT AGA CTT GCT 

25 AAC CG 3 ' ; 

The primers [sic] SEQ ID NO: 119 is specific for the 
env region of the C15 clone. 

An amplification product of 1673 bp was 
obtained and cloned in the following manner: 

3 0 the amplified DNA was inserted into a plasmid with the 
aid of the TA Cloning kit®. The various steps were 
carried out in accordance with the instructions for the 
TA Cloning kit® (Invitrogen) . At the end of the 
procedure, the white colonies of recombinant bacteria 

3 5 (white) were subcultured so as to be cultured and allow 
the extraction of the plasmids incorporated according 
to the so-called "miniprep" procedure. The plasmid 
preparation of each recombinant colony was cut with an 
appropriate restriction enzyme and analyzed on agarose 
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gel. The plasmids possessing an insert detected under 
UV light after staining the gel with ethidium bromide 
were selected for the sequencing of the insert after 
hybridization with a primer complementary to the T7 
5 promoter present on the cloning plasmid of the TA 
cloning kit®. The reaction prior to the sequencing was 
then carried out according to the method recommended 
for using the sequencing kit W PRISM™ Ready Reaction 
AmpliTaq® FS, DyeDeoxy™ Terminator" (Applied 

10 Biosys terns, ref. 402119) and the automated sequencing 
was carried out on the Applied Biosystems 373 A and 377 
apparatus, according to the manufacturer's 

instructions . 

The 5M6 clone obtained contains a region 

15 corresponding to the 3' region of the MSRV-1 envelope 
of 492 bp followed by the regions U3 , R and U5 (837 bp) 
of MSRV1. 

The 5M6 clone is represented by its nucleotide 
sequence (SEQ ID NO: 120) in Figure 5. The three 
20 potential reading frames of this clone are presented by 
their amineo [sic] acid sequence under the nucleotide 
sequence. The reading frame corresponding to the 
C-terminal end of the MSRV-1 env protein is identified 
by SEQ ID NO: 121. 

25 

EXAMPLE 4 : PREPARATION OF THE LB16 CLONE CONTAINING THE 
REGION ENCODING THE MSRV-1 RETROVIRUS INTEGRASE 

An RT-PCR was carried out on the total RNA 
treated with DNAsel and extracted from a choroid plexus 

3 0 obtained from an MS patient. The synthesis of cDNA was 
carried out with an oligo dT primer and the reverse 
transcriptase "'Expand™ RT" from Boehringer according to 
the conditions recommended by the company. A "no RT" 
control was carried out in parallel on the same 

35 material. A PCR was carried out with Taq polymerase 
(Perkin Elmer) under the following conditions: 95°C 
5 min", then 95°C 1 min, 55°C 1 min, 72°C 2 min over 35 
cycles and 72 °C for 8 min and with a final reaction 
volume of 50 jal . 
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Primers used for the PCR: 

- 5' primer, identified by SEQ ID NO: 122 
5 ' GGC ATT GAT AGC ACC CAT CAG 3 ' ; 

- 3' primer, identified by SEQ ID NO: 123 
5 5' CAT GTC ACC AGG GTG GAA TAG 3' 

The primer SEQ ID NO: 122 is specific for the 
pol region of MSRV-1 and more precisely similar to the 
integrase region described above. The primer SEQ ID NO 
123 was defined on sequences of the clones obtained 
10 during preliminary tests. 

An amplification product of about 760 bp was 
obtained only in the test with RT and was cloned in the 
following manner: 

the amplified DNA was inserted into a plasmid with the 
15 aid of the TA Cloning kit®. The various steps were 
carried out in accordance with the instructions for the 
TA Cloning kit® (Invitrogen) . At the end of the 
procedure, the white colonies of recombinant bacteria 
(white) were subcultured so as to be cultured and allow 

2 0 the extraction of the plasmids incorporated according 

to the so-called "miniprep" procedure. The plasmid 
preparation of each recombinant colony was cut with an 
appropriate restriction enzyme and analyzed on agarose 
gel. The plasmids possessing an insert detected under 
25 UV light after staining the gel with ethidium bromide 
were selected for the sequencing of the insert after 
hybridization with a primer complementary to the T7 
promoter present on the cloning plasmid of the TA 
cloning kit®. The reaction prior to the sequencing was 

3 0 then carried out according to the method recommended 

for using the sequencing kit "PRISM™ Ready Reaction 
AmpliTaq® FS, DyeDeoxy™ Terminator" (Applied 
Biosystems, ref. 402119) and the automated sequencing 
was carried out on the Applied Biosystems 373 A and 377 
35 apparatus, according to the manufacturer's 

instructions . 

The LB16 clone obtained contains the sequences 
corresponding to integrase. The nucleotide sequence of 



this clone was identified by SEQ ID NO: 12 4 in 
Figure 11, three reading frames are determined. 



EXAMPLE 5 : PREPARATION OF A CLONE 2, CL2 , CONTAINING IN 
3' A PORTION HOMOLOGOUS TO THE POL GENE, CORRESPONDING 
TO THE PROTEASE GENE, AND TO THE GAG GENE (GM3) 
CORRESPONDING TO THE NUCLEOCAPSID, AND A NEW 5 ' CODING 
REGION, CORRESPONDING TO THE GAG GENE MORE SPECIFICALLY 
THE TEMPLATE AND THE CAPSID of MSRV-1. 

A PCR amplification was carried out on the 
total RNA extracted from 100 \il of plasma from a 
patient suffering from MS. A water control, treated 
under the same conditions, was used as negative 
control. The synthesis of cDNA was carried out with 
3 00 pmol of a random primer (GIBCO-BRL, France) and the 
reverse transcriptase "Expand RT" (BOEHRINGER MANNHEIM, 
France) according to the conditions recommended by the 
company. An amplification by PCR ("polymerase chain 
reaction") was carried out with the enzyme Tag 
polymerase (Perkin Elmer, France) using 10 |al of cDNA 
under the following conditions: 94°C 2 min, 55°C 1 min, 
72°C 2 min then 94°C 1 min, 55°C 1 min, 72°C 2 min over 
3 0 cycles and 72 °C for 7 min with a final reaction 
volume of 50 jLil . 

Primers used for the PCR amplification: 

- 5' primer, identified by SEQ ID NO: 126 
5 ' CGG ACA TCC AAA GTG ATG GGA AAC G 3 ' ; 

- 3' primer, identified by SEQ ID NO: 127 
5 ' GGA CAG GAA AGT AAG ACT GAG AAG GC 3 ' 

A second amplification by so-called 
"seminested" PCR was carried out with a 5' primer 
situated inside the region already amplified. This 
second PCR was carried out under the same experimental 
conditions as those used during the first PCR, using 
10 ill of the amplification product derived from the 
first PCR. 

Primers used for the amplification by seminested PCR: 

- 5' primer, identified by SEQ ID NO: 128 
5 ' CCT AGA ACG TAT TCT GGA GAA TTG GG 3 ' ; 
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- 3' primer, identified by SEQ ID NO: 129 
5 ' TGG CTC TCA ATG GTC AAA CAT ACC CG 3 ' 

The primers SEQ ID NO: [lacuna] and SEQ ID NO: 
[lacuna] are specific for the pol region, clone G+E+A, 
5 more specifically the E region: nucleotide position No. 
423 to No. 448. The primers used in the 5' region were 
defined on sequences of clones obtained during 
preliminary tests. 

An amplification product of 1511 bp was 

10 obtained from the RNA extracted from the plasma of an 
MS patient. The corresponding fragment was not observed 
for the water control. This amplification product was 
cloned in the following manner. 

The amplified DNA was inserted into a plasmid 

15 with the aid of the TA Cloning kit™. The 2 jal of DNA 
solution were mixed with 5 \xl of sterile distilled 
water, 1 jal of a 10 times concentrated ligation buffer 
U 10X LIGATION BUFFER" , 2 jal of u pCR™ VECTOR" 
(25 ng/ml)and 1 |ul of W T4 DNA LIGASE" . This mixture was 

20 incubated overnight at 14°C. The following steps were 
carried out in accordance with the instructions of the 
TA Cloning kit® (Invitrogen) . The mixture was plated 
after transformation of the ligation into E. coli 
INVaF ' bacteria. At the end of the procedure, the white 

25 colonies of recombinant bacteria were subcultured so as 
to be cultured and allow the extraction of the plasmids 
incorporated according to the so-called U DNA 
minipreparation" procedure (17) . The plasmid 
preparation of each recombinant colony was cut with the 

3 0 restriction enzyme EcoRI and analyzed on agarose gel. 
The plasmids possessing an insert detected under UV 
light after staining the gel with ethidium bromide were 
selected for the sequencing of the insert after 
hybridization with a primer complementary to the T7 

35 promoter present on the cloning plasmid of the TA 
cloning kit®. The reaction prior to the sequencing was 
then carried out according to the method recommended 
for using the sequencing kit "PRISM™ Ready Reaction 
Amplitaq® FS, DyeDeoxy™ Terminator" (Applied 
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Biosystems, ref. 402119) and the automated sequencing 
was carried out on the Applied Biosystems 3 73 A and 377 
apparatus, according to the manufacturer's 

instructions . 

5 The clone obtained, called CL2 , contains a C- 

terminal region similar to the 5' terminal region of 
the clones G+E+A of MSRV-1, which makes it possible to 
define the C-terminal region of the gag gene and a new 
region corresponding to the N-terminal region of the 
10 MSRV-1 gag gene. 

CL2 makes it possible to define a region of 
1511 bp having an open reading frame in the N-terminal 
region of 1077 bp encoding 3 59 amino acids and a non- 
open reading frame of 454 bp corresponding to the 
15 C-terminal region of the MSRV-1 gag gene. 

The nucleotide sequence of CL2 is identified by 
SEQ ID NO: 130. It is represented in Figure 6 with the 
potential reading frames in amineo [sic] acid. 

The 1077 bp fragment of CL2 encoding 359 amino 

2 0 acids was amplified by PCR with the Pwo enzyme (5U/jil) 

(Boehringer Mannheim, France) using 1 |il of the DNA 
minipreparation of clone 2 under the following 
conditions: 95°C 1 min, 60°C 1 min, 72°C 2 min over 
25 cycles and with a final reaction volume of 50 (il 
25 with the aid of the primers: 

- 5' primer (BairtHI) , identified by SEQ ID NO: 132 

5' TGC TGG AAT TCG GGA TCC TAG AAC GTA TTC 3 ' (3 0 mer) , 
and 

- 3' primer (Hindlll), identified by SEQ ID NO: 133 

3 0 5 AGT TCT GCT CCG AAG CTT AGG CAG ACT TTT 3' (30 mer) 

corresponding, respectively, to the nucleotide sequence 
of clone 2 at position -9 to 21 and 1066 to 1095. 

The fragment obtained by PCR was linearized 
with BamHI and Hindlll and subcloned into the 
35 expression vectors pET28C and pET21C (NOVAGEN) 
linearized with BamHI and Hindlll. The sequencing of 
the DNA o f the 1077 bp fragment o f c lone 2 in the two 
expression vectors was carried out according to the 
method recommended for the use of the sequencing kit 




- 29 - 

* PRISM™ Ready Reaction Amplitaq® FS, DyeDeoxy™ 
Terminator " (Applied Biosys terns, ref. 402119) and the 
automated sequencing was carried out on the Applied 
Biosystems 373 A and 377 apparatus, according to the 
5 manufacturer ' s instructions . 

The expression of the nucleotide sequence of 
the 1077 bp fragment of clone 2 by the expression 
vectors pET2 8C and pET21C are identified by SEQ ID NO: 
13 5 and SEQ ID NO: 137, respectively. 

10 

EXAMPLE 6 : EXPRESSION OF CLONE 2 IN ESCHERICHIA COLI 

The constructs pET28c-clone 2 (1077 bp) and 
pET21C-clone 2 (1077 bp) synthesize, in the bacterial 
strain BL21 (DE3), a protein fused at the N- and C- 

15 terminus for the vector pET28C and the C- terminus for 
the vector pET21C with 6 Histidines, having an apparent 
molecular mass of about 45 kDa, identified by SDS-PAGE 
polyacrylamide gel electrophoresis (SDS = Sodium 
Dodecyl Sulfate) (Laemmli, 1970 (1)). The reactivity of 

20 the protein was demonstrated towards an anti-Histidine 
monoclonal antibody (DIANOVA) by the Western-blot 
technique (Towbin et al . , 1979 (2)). 

The recombinant proteins pET28c-clone 2 
(1077 bp) and pET21C-clone 2 (1077 bp) were visualized 

25 by SDS-PAGE in the insoluble fraction after enzymatic 
digestion of the bacterial extracts with 50 ^1 of 
lysozyme (10 mg/ml) and ultrasound lysis. 

The antigenic properties of the recombinant 
antigens pET28C-clone 2 (1077 bp) and pET21C-clone 2 

30 (1077 bp) were tested by Westen blotting () [sic] after 
solubilization of the bacterial pellet with 2% SDS and 
50 mM p-mercaptoethanol . After incubation with sera 
from patients suffering from multiple sclerosis, the 
sera from neurological controls and the sera from 

35 controls at the Blood Transfusion Center (CTS) , the 
immuno complexes were detected with the aid of an 
alkaline phosphatase-coupled goat serum anti-human IgG 
and ant i -human IgM. 

The results are presented in the table below. 
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TABLE 

Reactivity of sera affected by multiple sclerosis and 
controls with the MSRV-1 recombinant protein gag 
clone 2 (1077 bp) = pET21C-clone 2 (1077 bp) and 
5 pET28C-clone 2 (1077 bp) a 

DISEASE NUMBER OF NUMBER OF POSITIVE 

INDIVIDUALS INDIVIDUALS 

TESTED 

MS 15 6 

2<+++), 2{++), (2(+) 

NEUROLOGICAL 

CONTROLS 2 M+ + +) 

HEALTHY 

CONTROLS (CTS) 22 M + /-) 



(a) The strips containing 1 . 5 |ug of recombinant 
antigen pET-gag clone 2 (1077 bp) exhibit reactivity 
10 against sera diluted 1/100. The Western-Blot 
interpretation is based on the presence or absence of a 
specific pET-gag clone 2 (1077 bp) band on the strips. 
Positive and negative controls are included in each 
experiment . 

15 These results show that, under the technical 

conditions used, about 40% of the human sera affected 
by multiple sclerosis which were tested react with the 
recombinant proteins pET28C-clone 2 (1077 bp) and 
pET21C-clone 2 (1077 bp) . Reactivity was observed on a 

2 0 neurological control and it is of interest to note that 
the RNAs extracted from this serum, after the reverse 
transcriptase step, are also amplified by PCR in the 
pol region. This suggests that people who have not 
declared MS may also harbor and express this virus. On 

25 the other hand, an apparently healthy control (CTS 
donor) possesses anti-gag (clone 2, 1077 bp) 
antibodies. This is compatible with an immunity 
acquired against MSRV-1 independently of a declared 
associated autoimmune disease. 
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EXAMPLE 7 : PREPARATION OF AN LB13 CLONE CONTAINING IN 
3 ' A PORTION HOMOLOGOUS TO CLONE 2 CORRESPONDING TO THE 
GAG GENE AND IN 5' A PORTION HOMOLOGOUS TO THE 5M6 
CLONE CORRESPONDING TO THE U5 LTR REGION 
5 An RT-PCR ("reverse transcriptase-polymerase 

chain reaction") was carried out using total RNA 
extracted from virions, obtained from supernatants of B 
lymphocyte cells of patients suffering from multiple 
sclerosis, concentrated by ultracentrif ugations . The 
10 synthesis of cDNA was carried out with a specific 
primer SEQ No. XXX and the reverse transcriptase 
"Expand™ RT" from BOEHRINGER MANNHEIM according to the 
conditions recommended by the company* 

Primer used for the synthesis of the cDNA, identified 
15 by SEQ ID NO: 13 8: 

5 ' CTT GGA GGG TGC ATA ACC AGG GAA T 3 ' 

A PCR amplification was carried out with Tag 
polymerase (Perkin Elmer, France) under the following 
conditions: 94°C 1 min, 55°C 1 min, 72°C 2 min over 35 

2 0 cycles at 72 °C for 7 min and with a final reaction 

volume of 100 jil . - 

Primers used for the PCR amplification: 

- 5' primer, identified by SEQ ID NO: 13 9 
5' TGT CCG CTG TGC TCC TGA TC 3' 

25 - 3' primer, identified by SEQ ID NO: 138 
5 ' CTT GGA GGG TGC ATA ACC AGG GAA T 3 ' 

A second so-called "seminested" PCR 
amplification was carried out with a 3 ' primer situated 
inside the region already amplified. This second 

3 0 amplification was carried out under the same 

experimental conditions as those used during the first 
amplification, using 10 \il of the amplification product 
derived from the first PCR. 

Primers used for the "seminested" PCR amplification: 
3 5 - 5' primer, identified by SEQ ID NO: 139 
5' TGT CCG CTG TGC TCC TGA TC 3' 

- 3' primer, identified by SEQ ID NO: 140 
5' CTA TGT CCT TTT GGA CTG TTT GGG T 3 7 
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The primers SEQ ID NO: 138 and SEQ ID NO: 140 
are specific for the gag region, clone 2 nucleotide 
position No. 373-397 and No. 433-456. The primers used 
in the 5' region were defined on sequences of the 
5 clones obtained during preliminary tests. 

An amplification product of 764 bp was obtained 
and cloned in the following manner: 

The amplified DNA was inserted into a plasmid 
with the aid of the TA Cloning kit™. The 2 |il of DNA 
10 solution were mixed with 5 jal of sterile distilled 
water, 1 jul of a 10 times concentrated ligation buffer 
"10X LIGATION BUFFER 7 ' , 2 jxl of w pCR™ VECTOR" 
(25 ng/ml)and 1 \il of "T4 DNA LIGASE" . This mixture was 
incubated overnight at 14°C. The following steps were 
15 carried out in accordance with the instructions of the 
TA Cloning kit® (Invitrogen) . The mixture was plated 
after transformation of the ligation into E. coli 
INVaF ' bacteria. At the end of the procedure, the white 
colonies of recombinant bacteria were subcultured so as 

2 0 to be cultured and allow the extraction of the plasmids 

incorporated according to the so-called "DNA 
minipreparation" procedure (17) . The plasmid 
preparation of each recombinant colony was cut with the 
restriction enzyme EcoRI and analyzed on agarose gel. 
25 The plasmids possessing an insert detected under UV 
light after staining the gel with ethidium bromide were 
selected for the sequencing of the insert after 
hybridization with a primer complementary to the T7 
promoter present on the cloning plasmid of the TA 

3 0 cloning kit®. The reaction prior to the sequencing was 

then carried out according to the method recommended 
for using the sequencing kit "PRISM™ Ready Reaction 
Amplitaq® FS, DyeDeoxy™ Terminator * (Applied 
Biosystems, ref . 402119) and the automated sequencing 
3 5 was carried out on the Applied Biosystems 373 A and 377 
apparatus, according to the manufacturer's 

instructions . 

The LB13 clone obtained contains an N-terminal 
region of MSRV-1 gag gene homologous to clone 2 and an 
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LTR corresponding to a portion of the U5 region. 
Between the U5 region and gag, a binding site for the 
transfer RNAs, the PBS "primer binding site", was 
identified. 

5 The nucleotide sequence of the 7 64 bp fragment 

of the LB13 clone in the plasmid w pCR™ vector' 7 is 
represented in the identifier SEQ ID NO: 141. 

The binding site for the transfer RNAs , having 
a sequence of PBS tryptophan type, was identified at 
10 nucleotide position No. 342-359 of the LB13 clone. 

As this same PBS was found in the endogenous 
copies homologous to MSRVl, the endogenous family thus 
defined is henceforth called HERV W, according to the 
nomenclature proposed for the endogenous retrovirus 
15 families (W= tryptophan) . 

A short ORF of about 65 amino acids was found 
in the U5 region of the 5' LTR of the LB13 clone. 

Sequence of the ORF: 
PMASNRAITLTAWSKIPFLGIRETKNPRSENTRLATMLEAAHHHFGSSPPLSWEL 

20 WEQGPQVTIW. 

The corresponding nucleotide sequence starting 
at an ATG codon is capable of being expressed in a 
subgenomic DNA from a proviral LTR (U3RU5) . 

Another clone, called LA15, was obtained on the 

2 5 total RNA extracted from virions concentrated by 

ultracentrifugation from a culture supernatant of 
synoviocytes obtained from a patient suffering from 
rheumatoid arthritis. The strategy for amplifying and 
cloning the LA15 clone is exactly the same which was 

3 0 used for the LB13 clone. 

The nucleotide sequence of the LAI 5 clone, 
which is represented in the identifier SEQ ID NO: 142, 
is very similar to the LD13 clone. This suggests that 
the MSVR-1 retrovirus detected in multiple sclerosis 
35 has sequences which are similar to those found in 
rheumatoid arthritis. 
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EXAMPLE 8 : RECONSTRUCTION OF AN RU5-GAG REGION FROM THE 
CLONES LB15, LB13 , CL2 AND CL17 

The clones CL2 and LB13 have already been 
described in the preceding examples. The LB15 clone was 
5 obtained using the R sequence of the LTR of the cl6 
clone in order to define a primer in 5 7 and the anti- 
sense primers used are the same as for the LB13 clone. 
The CL17 clone was obtained by nested RT-PCR using the 
following primers: 

10 

5 ' -TCATGCAACTGCACTCTTCTGGTCCG- 3 ' ( sense ) 

5 ' -TCTTGCACTAACCTCCACTGTCCGTTGG-3 ' (antisense) 

5 ' -ATCCCCCAGTAACAATTTGGTGACCACG-3 ' ( sense) 
15 5 ' -TCGGGTCTAAGAGGGTACTTCCTTTGGTAGG-3 ' (antisense) 

The LB15 clone was obtained from virions 
obtained by culturing MS cells. The LB17 clone was 
obtained from culturing plasma from an MS patient. 
2 0 These overlapping clones made it possible to 

reconstruct an RU5-gag sequence with a potential ORF in 
the gag gene, as presented in Figure 14. 

EXAMPLE 9 : PREPARATION OF A CLONE 87-23 

2 5 The region corresponding to integrase was 

amplified and cloned from MS plasma using a seminested 
RT PCR with the following primers situated in the pol 
and env regions of MSRVl . 
In the pol region: 

3 0 5 ' - TTACGC AGGTCTC AGGGATGAGCTT - 3 ' { sense-primary PCR) 

5 ' -CGGC AGTAGCAGTCTTAGTATCTGAAGC AGTTA- 3 ' ( s ens e - 

secondary PCR) 

In the env region, 
3 5 5 ' - GGT ACGGAGGGTTTC ATGTAGTTTTGAG - 3 ' ( anti-sense primary 
and secondary PCR) 

The amplified clone contains 774 bp in the 
pol/RT region, all the integrase region (1197 bp) and 
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the start of the env region (480 bp) . The nucleotide 
sequence corresponding to- the integrase region and the 
translation to amino acids of the potential ORF are 
presented in Figure 15* 



EXAMPLE 10 : CONFIRMATION OF THE PRESENCE OF RNA 
CONTAINING ENV SEQUENCES RELATED TO ERV9 IN THE 
RETROVIRAL PARTICLES ASSOCIATED WITH THE MSRVl GENOME: 



10 minor proportion in the virion preparations obtained 
from MS compared with the MSRVl sequences. The 
existence of phenomena of co-encapsidation of 
phylogenetically related endogenous sequences into 
retroviral particles produced by a replicative strain 

15 has been described. Surprisingly, an RNA region 
comprising an ORF starting in the 3 ' portion of env and 
continuing potentially into the 3' LTR has been found 
in various MS samples. In order to specify the 
existence of an ORF, transcription- translation tests 

2 0 were carried out and made it possible to show the 
reality of an env ORF containing the entire 
transmembrane (TM) portion and ending at the start of 
the putative LTR. However, an additional frame (ORFX) 
follows and continues in the 3' LTR. The two products 

25 of expression were visualized and their respective ORFs 
were subcloned. Figure 16 represents the nucleotide and 
peptide sequences of the B13 clone already described, 
specifying the ORFs in the truncated env region and in 
the putative LTR. The presence of such RNAs may be 

30 responsible for recombinations with the replicative 
strain and consequently generate strains having a 
modified pathogenicity. 



5 



Sequences related to ERV9 have been found in a 
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CLAIMS 

1. Nucleic material, in isolated or purified 
state, comprising a nucleotide sequence chosen from the 

5 group which consists of (i) the sequences 

SEQ ID NO: 112, SEQ ID NO: 114, SEQ ID NO: 117, 

SEQ ID NO: 120, SEQ ID NO: 124, SEQ ID NO: 130, 

SEQ ID NO: 141 and SEQ ID NO: 142; (ii) the sequences 
complementary to sequences (i) ; and (iii) the sequences 
10 equivalent to sequences (i) or (ii) , in particular the 
sequences having, for every series of 100 contiguous 
monomers, at least 50%, and preferentially at least 70% 
homology with sequences (i) or (ii) respectively. 

2 . Nucleic material, in isolated or purified 
15 state, encoding a polypeptide having, for every 

contiguous series of at least 3 0 amino acids, at least 
50%, and preferably at least 70% homology with a 
peptide sequence chosen from the group which consists 
Of SEQ ID NO: 113, SEQ ID NO: 115, SEQ ID NO: 118, 
20 SEQ ID NO: 121, SEQ ID NO: 135 and SEQ ID NO: 137. 

3. Retroviral nucleic material, whose pol gene 
comprises a nucleotide sequence identical or equivalent 
to a sequence chosen from the group which consists of 
SEQ ID NO: 112, SEQ ID NO: 124 and their complementary 

25 sequences. 

4. Retroviral nucleic material, in which the 5' 
end of the pol gene starts at nucleotide 1419 of SEQ ID 
NO: 13 0. 

5. Retroviral nucleic material, in which the pol 
3 0 gene encodes a polypeptide having, for every contiguous 

series of at least 30 amino acids, at least 50%, and 
preferably at least 70% homology with the peptide 
sequence SEQ ID NO: 113. 

6. Retroviral nucleic material, in which the 3' 
35 end of the gag gene ends at nucleotide 1418 of SEQ ID 

NO: 13 0. 

7. Retroviral nucleic material, in which the env 
gene comprises a nucleotide sequence identical or 
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equivalent to a sequence chosen from the group which 
consists of SEQ ID NO: -117, and its complementary 
sequences . 

8. Retroviral nucleic material, in which the env 
5 gene comprises a nucleotide sequence which starts at 

nucleotide 1 of SEQ ID NO: 117 and ends at nucleotide 
at nucleotide [sic] 233 of SEQ ID NO: 114. 

9. Retroviral nucleic material, in which the env 
gene encodes a polypeptide having, for every contiguous 

10 series of at least 3 0 amino acids, at least 50%, and 
preferably at least 7 0% homology with the sequence 
SEQ ID NO: 118. 

10 . Retroviral nucleic material in which the U3R 
region of the 3' LTR comprises a nucleotide sequence 

15 which ends at nucleotide 617 of SEQ ID NO: 114. 

11. Retroviral nucleic material in which the RU5 
region of the 5' LTR comprises a nucleotide sequence 
which starts at nucleotide 755 of SEQ ID NO: 120 and 
ends at nucleotide 337 of SEQ ID NO: 141 or SEQ ID NO: 

20 142. 

12. Retroviral nucleic material comprising a 
sequence which starts at nucleotide 755 of SEQ ID NO: 
120 and which ends at nucleotide 617 of SEQ ID NO: 114. 

13 . Retroviral nucleic material according to any 
25 one of the preceding claims, characterized in that it 

is associated with at least one autoimmune disease such 
as multiple sclerosis or rheumatoid arthritis. 

14. Nucleotide fragment comprising a nucleotide 
sequence chosen from the group which consists of (i) 

3 0 the sequences SEQ ID NO: 112, SEQ ID NO: 114 , 

SEQ ID NO: 117, SEQ ID NO: 12 0, SEQ ID NO: 124, 

SEQ ID NO: 13 0, SEQ ID NO: 141 and SEQ ID NO: 142; (ii) 
the sequences complementary to sequences (i) ; and (iii) 
the sequences equivalent to sequences (i) or (ii) , in 

35 particular the sequences having, for every series of 
100 contiguous monomers, at least 50%, and 
preferentially at least 70% homology with sequences (i) 
or (ii) respectively. 
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15. Nucleotide fragment according to Claim 14, 
consisting of a nucleotide sequence chosen from the 
group which consists of (i) the sequences 
SEQ ID NO: 112, SEQ ID NO: 114, SEQ ID NO: 117, 

5 SEQ ID NO: 12 0, SEQ ID NO: 124, SEQ ID NO: 13 0, 

SEQ ID NO: 141 and SEQ ID NO: 142; (ii) the sequences 
complementary to sequences (i) ; and (iii) the sequences 
equivalent to sequences (i) or (ii), in particular the 
sequences having, for every series of 100 contiguous 
10 monomers, at least 50%, and preferentially at least 7 0% 
homology with sequences (i) or (ii) respectively. 

16. Nucleotide fragment comprising a nucleotide 
sequence encoding a polypeptide having, for every 
contiguous series of at least 3 0 amino acids, at least 

15 50%, and preferably at least 70% homology with a 
peptide sequence chosen from the group which consists 
Of SEQ ID NO: 113, SEQ ID NO: 115, SEQ ID NO: 118, 
SEQ ID NO: 121, SEQ ID NO: 135 and SEQ ID NO: 137. 

17. Nucleotide fragment according to claim 16, 

2 0 consisting of a nucleotide sequence encoding a 

polypeptide having, for every contiguous series of at 
least 3 0 amino acids , at least 50% , and preferably at 
least 7 0% homology with a peptide sequence chosen from 
the group which consists of SEQ ID NO: 113, SEQ ID NO: 
25 115, SEQ ID NO: 118, SEQ ID NO: 121, SEQ ID NO: 135 and 
SEQ ID NO: 137. 

18. Nucleic probe for the detection of a retrovirus 
associated with multiple sclerosis and/or rheumatoid 
arthritis, characterized in that it is capable of 

3 0 hybridizing specifically with any fragment according to 

any one of claims 14 to 17, belonging to the genome of 
said retrovirus . 

19. Probe according to claim 18, characterized in 
that it possesses from 10 to 100 nucleotides, 

3 5 preferably from 10 to 3 0 nucleotides. 

20. Primer for the amplification, by 
polymerization, of an RNA or of a DNA of a retrovirus 
associated with multiple sclerosis and/or rheumatoid 
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arthritis, characterized in that it comprises a 
nucleotide sequence identical or equivalent to at least 
a portion of the nucleotide sequence of a fragment 
according to any one of claims 8 to 11, in particular a 
5 nucleotide sequence having, for every series of 10 
contiguous monomers, at least 50%, preferably at least 
70% homology with at least said portion of said 
fragment . 

21. Primer according to claim 20, characterized in 

10 that its nucleotide sequence is chosen from 
SEQ ID NO: 116, SEQ ID NO: 119, SEQ ID NO: 122, 

SEQ ID NO: 123, SEQ ID NO: 126, SEQ ID NO: 127, 

SEQ ID NO: 128, SEQ ID NO: 129, SEQ ID NO: 132, and 
SEQ ID NO: 133 . 

15 22. RNA or DNA, and in particular replication 

and/or expression vector, comprising a genomic fragment 
of the nucleic material according to any one of claims 
1 to 7 or a fragment according to any one of claims 14 
to 17. 

20 23 . Peptide encoded by any open reading frame 

belonging to a nucleotide fragment according to any one 
of claims 14 to 17, in particular a polypeptide, for 
example oligopeptide forming or comprising an antigenic 
determinant recognized by sera of patients infected 

25 with the MSRV-1 virus, and/or in whom the MSRV-1 virus 
has been reactivated. 

24. Peptide according to claim 23 comprising a 
sequence identical, partially or completely, or 
equivalent to a sequence chosen from SEQ ID NO: 113, 

30 SEQ ID NO: 115, SEQ ID NO: 118, SEQ ID NO: 121, 

SEQ ID NO: 135 and SEQ ID NO: 137. 

25. Diagnostic, prophylactic or therapeutic 
composition, in particular for inhibiting the 
expression of at least one retrovirus associated with 

35 multiple sclerosis and/or rheumatoid arthritis, 
comprising a nucleotide fragment according to any one 
of claims 14 to 17. 
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26. Method for detecting a retrovirus associated 

with multiple sclerosis and/or rheumatoid arthritis, in 
a biological sample, characterized in that an RNA 
and/or a DNA assumed to belong to or obtained from said 
5 retrovirus, or their complementary RNA and/or DNA, is 
brought into contact with a composition comprising a 
nucleotide fragment according to any one of claims 14 
to 17. 
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FZ6 2 



10 20 30 40 50 

1234567890 1234567890 1234567890 1234567890 1234567890 

GCTIATAGAA G3ACG0CTAG TAT0333I2iA TO3ZCICI03 GAAAQCAAGC 50 
AYRR TPS MG. SP'LG NQA 
LIE GPLV WGN PLW ETKP 
L . K DP. YGVI PSGKPS 

OCTAGTACTC AGCAQGAAAA ATAGAAIAGG AAAGCICACA AGGACATACr 100 
PVL SRKN RIG NLT RTYF 
QYS AGK IE.E TSQ GHT 
PSTQ QEK .NR KPHK DIL 

TIQZTGOOCT CXHAGMQ3CT AGQCACTGAG GAAGGAAAAA TACTITCAOC 250 

PPL QMA SH.G RKN TFT 
FLPS RWL ATE EGKI LSP 
SSP PDG. PLR KEK YFHL 

TGCAQCTAAC CAACAGAAAT TACITAAAAC OCTICAOZAA AOCTIOCACr 200 
CS.P TEI T.N PSPN LPL 
AAN QQKL LKT LHQ TFHL 
QLT NRN YLKP FTK PST 

TAG3CATIGA TAGCACOCAT CAGATG30CA AATTATTATT TACIGGACTA 250 
R H . .HPS DGQ III YWTR 

GID STH QMAK LLF TGP 
• A L I API RWP NYYL LDQ 

Q3XT1T1LA AAACTATCAA GAAGATAGTC AGQGGCIGTG AAGIGTGGCA 300 

PFQ NYQ EDSQ GL. SVP 
GLFK TIK KIV RGCE VCQ 
AFS KLSR R.S GAV KCAK 

AAGAAATAAT 310 
K K . 
R N N 
E I 
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FIG 2 (continued) 



10 20 30 40 50 

1234567890 1234567890 1^4567890 1234567890 1234567890 

CCUimmLT TmAOCTCCT IGTTAAGrETT Ull'llTllCA GAATCAAAAC 50 
PCIF N L L VKF VSSR IKT 
PVS LTSL LSL SLP ESKL 
LYL .PP C.VC LFQ NQN 

TCTSAAACm CAAATTGTIC TICAAA1G3A QCAGCAGATC GAGIQCATGA 100 
VKL QIVL QME HQM ESMT 
.NY KLF FKWS TRW SP. 
CKTT NCS SNG APDG VHD 

CTAAGATOCA OCGTGGAOGC CT33AD033C CT3CTAGCCC ATGCICOGAT 150 
KIH RGP L D R P ASP CSD 

LRST VDP WTG L LAH APM 
.DP PWTP GPA C.P MLRC 

GTTAATCACA TIGAAGGCAC G0CICGO3AG GAAATCICAA CIGCACAACC 200 
VNDI EGT PPE EIST AQP 
LMT LKAP LPR KSQ LHNP 
. .H . R H PSRG NLN CTT 

OCTACTATGC COCAATICAG QQ33AAQCAG TTAGAGDSGT CATCAQCCAA 250 
LLC PNSA GSS .SG HQPT 
YYA PIQ REAV RAV ISQ 
PTMP QFS GKQ LERS SAN 

CCIDCCCAAC AGCACTIGG3 TITlUL'lUl ' l ' GAGAQ33333 ACIGAGAGAC 300 

SPT ALG FSC. EGG LRD 
PPQQ HLG FPV ERGD .ET 
LPN STWV FLL RGG TERQ 

AGGACTAGCT GGA Tl ' lUL ' JA GGCCAACGAA GAA1CCCTAA GQCTAGCIGG 350 
RTSW IS. ANE ESLS LAG 
GLA GFPR PTK NP. A.LG 
D.L DFL GQRR IPK PSW 
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FIG 3 
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12345678 90 123456789Q 1234567890 1234567890 1234567890 
GAAG3TGACT QCAIDZAOCT CIAAACATO3 GGCTIGCAAC TTAGZTCACA 400 
KVT ASTS KHG ACN LAHT 
R.L HPP LNMG LAT .LT 
EGDC I H L . T W GLQL S S H 

OGOGACCAAT CAGAGAQCIC ACTOAAAIGC TAATEAG3CA AAAATAG3AG 450 

RPI REL TKML IRQ K.E 
PDQS ESS LKC L G K NRR 

PTN QRAH . N A N.A KIGG 

GIAAAGAAAT AGOCAATCAT C3ATIGQCIG AGAGCACAGC GGGAG33ACA 500 
VKK. PII YCL RAQR EGQ 
. R N SQSS IA. EHS GRDK 
KEI ANH LLPE STA GGT 



AQSATO3QGA T3EAAA00CA GC3CATIOGAG 0033CAA033 CAACCQOCTr 550 
GSG YKPR HSS RQR QPPL 
DRD INP GIRA GNG NPL 
RIGI . T Q AFE PATA TPF 

iGGGitmrr cxx-'muiar gqgogcicig titicacict ATITCACICT 600 

GPL PLY GRSV FTL FHS 
WVPS LCM GAL FSLY FTL 
GSP PFVW ALC FHS ISLY 

ATEAAATCIT GCAACIGAAA AAAAAAAAAA AAAAA 635 
IKSC N.K KKK K 
L N L ATEK KKK K 
• IL QLK KKKK K 
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FIG 4 
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AIQGOaCTGC CTIATCATAC 'iTl'lUiUl'l'l' AL'lUriL'lLT TMJJOCCTTT 50 
MALP YHT FLF T V L L PPF 
WPS LIIL FSL LFS YPLS 
GPP LSY FSLY CSL TPF 

03CTCICACT GCAOCOGCIC CAIGCiUC'lG TACAAOCAGT AGCIOOOCTT 100 
ALT APPP CCC TTS SSPY 
LSL HPL HAAV QPV APL 
RSHC TPS MLL YNQ. LPL 

ACCAAGAGTT TCTATGAAGA ACX3039CITC CTGGAAATAT TGAIGQO0CA 150 

QEF L.R TRLP GNI DAP 
TKSF YEE RGF LEIL MPH 
PRV SMKN A A S WKY .CPI 

TCATATAG3A GITTATCTAA GGGAAACTCC ACCITCACIG OOCSOCCXZA 200 
SYRS LSK GNS TFTA HTH 
HIG VYLR ETP PSL PTPI 
I.E F I . G K L H LHC PHP 

TATGO0QCX3C AACIQCTATA ACTCIGCCAC TCTTTGCATG CATGCAAATA 250 
MPR NCYN SAT LCM HANT 
CPA TAI TLPL FAC MQI 
YAPQ L L . LCH SLHA CKY 

CTCA l'Jm'lG GACAG33AAA ATGATTAATC CTAGITGTCC TGGAG3ACTT 300 

HYW TGK MINP SCP G G L 
LIIG QGK .LI LVVL EDL 
SLL DREN D.S .LS WRTW 

GGAGGCACTG TCIGTTGGAC TI3VCTTCAGC CATACCAGTA TGTCIGATG3 350 
GATV CWT YFT HTSM SDG 
EPL SVGL TSP IPV CLMG 
SHC LLD LLHP YQY V.W 
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ACUICAOCIG TGTAAAATIT AGCAATACIA TJO^CACAAC CAGCIUDCAA 750 

LTC VKF SNTI DTT SSQ 
TSPV N L AIL . T Q P APN 

PHL C K I . QYY RHN QLPM 

TGCATCAQ3T G3GTEAACAQT TOa3£A03A ATAG'ICIQOC TAOOCICAQG 800 
CIRW VTP PTR IVCL PSG 
ASG G.HL PHE .SA YPQE 
HQV GNT SHTN SLP TLR 

AATRliTlTlT GTCIGTGGTA CXJTCAQOCTA TCAl'lUlTiG AATGGCICIT 850 
IFF VCGT SAY HCL NGSS 
YFL S V V PQPI IV. MAL 
NIFC LWY LSL S'L F E W L F 



CAGAATCIAT GiUCTlUL'lU TCAT1LT1AG 1QOCXXCEAT GACOTCIAC 900 

ESM CFL SFLV PPM TIY 
QNLC ASS HS. CPL. PST 
RIY VLPL I L S APY DHLH 

ACIGAA2AAG ATTEATACAA TCA1U1UU1A OCmAGGCOC ACAACAAAAG 950 
TEQD LYN HVV PKPH NKR 
LNK IYTI MSY LSP TTKE 
. T R FIQ SCRT .AP QQK 

AGTAOCCATr L.TIULTITIU TERTCAGAGC AQSAGIGCTA GGGAGACTAG 1000 
VPI LPFV IRA GVL GRLG 
YPF FLL LSEQ.EC. AD. 
STHS SFC YQS RSAR QTR 

GTACIG3CAT TGGCftGTATC ACAACCICEA CICAGl'lL'IA CIACAAACEA 1050 

TGI GSI TTST QFY YKL 
VLAL AVS QPL LSST TNY 
YWH WQYH NLY SVL LQTI 
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TCICAAGAAA TAAAIGGIGA (30G3AACAG GICACIGACT CXDCIQGICAC 1100 
SQEI N GD MEQ VTDS LVT 
LKK M V T WNR S L T PWSP 

SRN KW. HGTG H.L PGH 

CTIGCAAGAT GAACTU^ACT C0CI3OIAGC AGTALJILUIT CAAAATOGAA 1150 
LQD Q L N S L A A V V L QNRR 
CKINLT P. QQ -SFKIE 
LARS T.L PSS SSPS KSK 

GAGCITEAGA CTIGCTAA02 GOCAAAAGAG G33GAAGCIG TTEAi'ITl'IA 1200 

ALD LLT AKRG GTC LFL 
EL.T C.P PKE G.E P V Y F . 
SFR LANR QKR GNL FIFR 

GGAGAAGAAC Q L'lUl'JJA TIA TCTIAATCAA TOZAGAATIG TCACIGAGAA 1250 
GEER CYY VNQ SRIV TEK 
EKN AVIM LIN PEL SLRK 
RRT LLL C.SI QNC H.E 

AGTEAAAGAA ATIOGAGATC GAA32£AAIG TAGAGCAGAG GAGCTKAAA 1300 
VKE IRDR IQC RAE ELQN 

LKK FEI EYNV EQR SFK 
S.RN SRS NTM S R G ASK 

ACACCGAACG CIGGGGOCIC CICAGOCAAT QGAIQGOCIG GJI'ICIUXJC 1350 

TER WGL LSQW MPW VLP 
TPNA GAS SAN GCPG FSP 
HRT LGPP QPM DAL GSPL 

T1LTUAGGAC CIUEAGCAGC TCEAATKTIG TIAC'ICUICT TIQ3ACOCIG 1400 
FLGP L A A LIL LLLF GPC 
S.D L.QL . Y C YSS LDPV 
LRT SSS SNIV TPL WTL 
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TALLLTJ.TAAC CICLl'lUl'lA AULTIUIUIU TTCCAGAATr GAAQCTUJLAA 1450 
IFN LLVK FVS SRI EAVK 
SLT SLL SLSL PEL KL. 
YL.P PC. VCL FQN. SCK 

AQCffiCAGAT QJ1LT12ACAA ATO3AAO0QC A 1481 

LQM VLQ MEP 
SYRW SYK WNP 
ATD GLTN GTP 
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TCAAAAT3SA AGAQ LTi'lA G ALTIUL'IAAC 03CX2AAAGA C33333ftAOCT 50 
SKSK SFR LAN RQKR GNL 
QNR RALD LLT AKR GGTC 
KIE EL. TC.P PKE GEP 

Gl'l'imTlTl' AQQQGAAGAA HQL'lUi'lSAGT AlUi'lAATCA ATCIQ3AATC 100 
FIF RGRM LLV C.S IWNH 
LFL GEE CC.Y VNQ SGI 
VYF. GKN AVS MLIN LES 

ATIACIGAGA AAGITAAAGA AATTIGAGAT Q3AATATAAT G12AGAG2AGA 150 

Y.E S.R NLRS NIM .SR 
ITEKVKE I.D RI.C RAE 
LLR KLKK FEI E'YN VEQR 

GGA C L Ti U A A AACACIGCAC OCT3333CCT OCICAGOCAA TGGAT3CXXT 200 
GPSK HCT LGP PQPM DAL 
DLQ NTAP WGL LSQ WMPW 
TFK TLH PGA S- SAN GCP 

GGACICIOOC: LT1LT1!A QGA OJ1U1AGCAG CEATAATATT TITACIOITC 250 
DSP LLRT SSS YNI FT. PL 
TLP FLG PLAA IIF LLL 
GLSP S.D L.Q L.YF YSS 

TTTOGACOCT GTAlUi'lCAA CTIDZFIGTT AAGTTTGICT CITOCAGAAT 300 

WTL YLQ LPC. VCL FQN 
FGPC IFN FLV KFVS SRI 
LDP VSST SLL SLS LPEL 

TGAAGCIGEA AAGCEACAAA TAGTICITCA AATQ3AACCC CAGATOCAGT 350 
. SCKATN SSS NGTP DAV 
EAV KLQI VLQ MEP QMQS 
KL. SYK F F K WNP RCS 
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CCATCftCIfcA AAICTA003T QGAQXXTIG3 ACgGSOCIGC TASOMOC 400 
HD. NLPW TPG PAC T M L 

MTK IYR GPLD RPA RLC 
P.LK STV DPW TGLL DYA 

TCIGAIGTIA ATGACATIGA AGICAOOCCT GCOGAQGAAA TCICAACIGC 450 

. C . . H . SHPS RGN LNC 
SDVN DIE VTP PEEI STA 
LML MTLK SPL PRK SQLH 

ACAACGCCm CTPCNJTCCA ATICACTAG3 AAGCAGTTAG AGCAGTTGIC 500 
TTPT TLQ FSR K Q L E QLS 
QPL LHSN SVG S S . SSCQ 
NPY YTP I Q .. E A V R AVV 

AGCTAAOTC CCCAACAGTA CTIG33TTIT GCIGTIGAGA QQGIGGftCIG 550 
ANL PNST WVF LLR GWTE 
PTS PTV LGFS C.E GGL 
SQPP QQY LGF PVER VD. 

AGAGACAQGA CTAGCIGGAT TTOCTAG3CT GACIAAGAAT CCQ3AAGXT 600 

RQD . L D F L G . L R I PKP 
RDRT S W I S.A D.ES XSL 
ETG LAGF PRL TKN PXAX 

ANCIGGGAAG GIGAOOQCKT CCAICTTIAA ACATGGGQCT TQCAACITftG 650 
XWEG DRI H L . TWGL QLS 
XGK VTAS IFK HGA CNLA 
LGR . P H PSLN MGL AT. 

CKZACAOOCX3 AOCAATCAGA. GAGCICACIA AAAIGCEAAT CAG3CAAAAA 700 
SHP TNQR AH. NAN QAKT 
HTR PIR ELTK M L I RQK 
LTPD QSE SSL KC.S GKN 
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CAG3AQG7IAA AGCAATAG02 AATCATCTAT TGOCTGAGAG GACAGCX333A. 750 

GGK A I A NHLL PES TAG 
QEVK Q.P IIY CLRA QRE 
RR. SNSQ SSI A.E HSGK 

AQGACAAQGA TIX333&IAIA. AACICAG3ZA TICAAGCCAG CAACAG2AAC 800 
RTRI GI. TQA FKPA TAT 
GQG LGYK LRH SSQ QQQP 
DKD WDI NSGI Q A S NSN 

aXLTJL'JLUQS TOCmCODCA TTGTATGGGA GCICIGTITT CACICEATIT 850 
PFG SPPI VWE LCF HSIS 
PLG PLP LYGS SVF TLF 
PLWV PSH CMG ALFS LYF 

CACICTAITA AATCATGCAA CTGCACICTT CTGGIOCX3IG TITTITATGG 900 

L Y . IMQ LHSS GPC FLW 
HSIK SCN CTL LVRV F.YG 
T L L NHAT ALF WSV FFMA 

CICAAGCIGA QCITTIGTIC GOZATOCACX: ACIGCIGTTT G0CAO33ICA 950 
LKLS FCS PST TAVC HRH 
SS. AFVR HPP LLF ATVT 
QAE LLF AIHH CCL PPS 

CAGACcmrr gcigaci'iuu atcxxttigg atocagcaga gtegtocacig 1000 

RPA ADFH PFG SSR VSTV 
DPL LTS IPLD PAE CPL 
QTRC .LP SLW IQQS VHC 

TGCICCIGAT GCAG0GAG3T ACXXZATTGOC ACIODQGATC AG3CTAAAG3 1050 

L L I QRG THCH SRS G.R 
CS.S SEV PIA TPDQ AKG 
APD PARY PLP LPI RLKA 
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CTIGOCATIG TTOOT3CAIG GCmZGPGCC T33J1T1U1U CTAATAGAAC 1100 
LAIV PAW LSA WVCP NRT 
LPL FLHG .VP GFV LIEL 
CHC SCM AKCL GLS . . N 

TCAACACIGS TCACIGGGTT OCAIGGTICr CTIGCAIGAC GCACQC3ZCTC 1150 
EHW SLGS MVL FHD PRLL 

NTG HWV PWFS SMT HGF 
• T L V TGF HGS LP.P TAS 

TAATAGAGCT ATAACACICA OOXAIGGOC CAAGATIDZA TIU LTI UAIA 1200 

IEL .HS PHGP RFH SLV 
. . S Y NTH RMA Q DSI PWY 
NRA ITLT AWP KIP FLGI 

TCIGIGAGGC CAAGAAOCXr AGGflCAGAGA AN3IGAGGCT 1GCXZACCATT 1250 
SVRP RTP GQR X.GL PPF 
L.G QEPQ VRE X E A CHHL 
CEA KNP RSEX VRL ATI 

TGGGAAGIGG GCXaCIGOCA TTTIGGTAGC G3XCADCAC CATCTIG3C3A 1300 
GKW PTAI L V A AHH HLGS 
GSG PLP FW.R PTT ILG 
WEVA HCH FGS GPPP SWE 



QCTGEQ33AG CAAGGATOQC OCAGIAACA 

CGS KDP PVT 
A V G A RIP Q. 
LWE QGSP SN 
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OCTAGAACCT ATICIQGB3A A1TO33ADCA MGIGACACT OGA03CTAA 50 
PRTY SGE L G P M.HS-DAK 
LER ILEN WDQ CDT QTLR 
.NV FWR IGTN VTL R R 

GAAAGAAAGG ATTEATATIC TTCTOCASTA OZO0CIQQQ2 AZAATATOZTT 100 
KET I Y I L LQY RLA TISS 
KKR FIF FCST AWP QYP 
ERND LYS SAV PPGH NIL 

CTTCAMQ3A. GAGAAAOCIG QCTTQCIGAG QGAA3TKTAA ATEATAACAT 150 

SRE RNL AS.G KYK L.H 
LQGR ETW LPE GSIN Y N I 
FKG EKPG FLR E V . IITS 

CA3CTI&CAG CZPGPCCTCT TCTGEAGAAA QGAG33CAAA TQG&3IGAA3 200 
HLTA RPL L . K GGQM E.S 
ILQ LDLF CRK EGK'WSEV 
SYS . T S SVER RAN GVK 

GCAAAJITIL' TITICATIAA GSGACAACIC A^AASTATOT 250 
AIC A N F L FIK RQL TIM. 
PYV QTF FSLR DNS QLC 
CHMC K L S FH. ETTH NYV 

AAAAAjIGIG GTCTAHQOOC TACMGAMC UC1CA2AGIC CAOJIUJL'IA 300 

KVW FMP YRKP SES TSL 
KKCG LCP TGS P QSP PPY 
KSV VYAL Q E A LRV HLPT 

cxiisGasic arriorcGA ciultiujiu aactaaimg csujxxxtp 350 

PQRP LPD S F L N . . G PPF 
PSV PSPT PSS TNK DPPL 
PAS PPR L L P Q LIR TPL 

TAA3XBAAC QTCOCRftMG GEGATA3*£A AA3333EAAA CAATOAACEA 400 
NPN GPKG DRQ RGK Q.TK 

TQT VQK EIDK GVN NEP 
• P K R S K R R . T K G . T MNQ 

AAGSGIQGgl A3KTIULUJJ ATIIA1UJLUJ CIQCAMCTG TGAGAG5fiGG 450 

ECQ YSP IMPP PSS ERR 
KSAN IPR LCP LQAV RGG 
RVP IFPD YAP SKQ . E E E 

A3AAnxTi jc a^ gjC A GaG 'l a xr igia a: Trmciciu TCASAcrraA 500 

RIRP SQS ACT FFSL RLK 
EFG PARV PVP FSL S D L K 
NSA QPE CLYL FLS QT. 
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/OCAAATTAA .AATAGADZTA QGTAAATICr CTGAmAXT TGftOSXTAT 550 
AN. NRPR . I L R.P .. RLY 
Q I K IDL GKFS DNP DGY 
SKLK .T. VNSQITLTAI 

ATIGKlUlTr TACAAGQSTT AGGACAATOC THGATCTGA CA3GGAGAGA 600 

■ C F TRV RTIL . S D MER 
IDVL QGL GQS FDLT WRD 
LMF YKG. DNP LI. HGEI 

TATAATCTTA CIACTAAATC A3ACACTAAC Q0CMAIGA3 MAAGTT33CG 650 
YNVT TKS D T N P K . E KCR 
I M L LLNQ TLT PNE RSAA 
. C Y Y.I RH.P QMR EVP 

CTGTAACTOC AC3DCX33AGAG TTTO332ATC TTTO3TAICT CAGICA330C 700 
C N C SPRV WRS LVS QSGQ 
VTA ARE FGDL WYL S Q A 
L.LQ PES LAI FGIS.VRP 

AACAAEMGA TGACAACSGA GGAMGAACA ACIQDC&CaG G3iaa3GQC 750 

Q.D DNR GKNN SHR PAG 
NNRM TTE ERT TPTG QQA 
TIG . Q Q R KEQ LPQ ASRQ 

A3ITOCOCT GTftGfiCXXTC ATIG3GSCAC A3AATOGAA CA1G3MMT 800 
SSQC RPS LGH RIRT WRL 
VPS VDPH WDT ESE HGDW 
FPV • T L IGTQ NQN M E I 

O^ILXJL^CAA ACOTTOCTA ALT1UUUIQC TOGAflGSO 1 GB03AAMCr 850 
VPQ TFAN LRA RRT E E N . 
CHK H L L TCVL EGL RKT 
GATN I C . LAC . K D . GKL 

A3SA&GAK2: CmroAATm CICAATOATC TO3CTBTAA CACM33AAA 900 

EEA YEL LNDV HYN TGK 
RKKP MNY SMM STIT QGK 
GRS L.IT Q . C PL. HRER 

G3MGAAAAT CITECIUZTT TICIQGACW3 A2IMGQ3A3 QCMTCTG3A 950 
GRKS YCF SGQ TKGG IEE 
EEN LTAF LDR LRE ALRK 
K K I LLL FW.TD .GR H.G 

MCATftOCTC OCTOICftOCT AA33CXMCT AA lilTIA AAG 1000 

AYL PVT. LY. RPT NLKG 
HTS LSP DSIE GQL ILK 
SIPP CHL TLL KAN. S.R 
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GATAfi^riTA TCPCTCPOTC PGCTGCPC^C ATEAGAAAAA ALT1UAAAAG 1050 

• V Y HSV SCRH .KK LQK 
DKFI TQS A A D I R K N FKS 
ISL SLSQ LQT LEK TSKV 

TCIGOTTAG QQ003 3 MCA GAACTE&3AA AOOCEAIITTA iUJi'lUJLMC 1100 
SALG PEQ N L E T L F N LAS 
LP. ARSR T.K PYL TWHP 
CLR PGA E LRN PI. LGI 

CICa UTri ' lT TAIAAIJGM AICftGGMGA QCAG30GAAA 003GACAAAC 1150 
SVF YNRD QEE QAK RDKR 
QFF I I E IRRS RRN GTN 
LSFL . R SGG AGET GQT 

Q3GATAAAAA AAAAAGQQ3G OJlOaCEAC TITAGICAIG QOXTCMQC 1200 

DKK KRG G P L L . S W PSG 
GIKK KGG VHY FSH.G PQA 
G.K KKGG STT L V M ALRQ 

AAOCAGAdT IGGAGQdCT QCAAAEGQGA AAAGCTO33C AAATCAAATC 1250 
KQTL EAL QKG KAGQ IKC 
SRI* WRLC KRE K L G KSNA 
ADF GGS AKGK SWA NQM 

0CTAA33O3G CIQGCITXA GIGCGGSCEA C3*M3ACACT TTAAAAAMA 1300 
LIG LASS AVY KDT LKKI 
. . G WLP VRST RTL . K R 
PNRA GFQ CGL QGHF KKD 

M'l'AMt T 'AJVTT AGAAATAMC (Tjrrj* V 1 ' 11 * TOraiGOOOC TEAG3ICAM3 1350 

IQV EIS RPLV HAP YVK 
LSK. K . A APL SMPL TSR 
YPS RNKP PPC PCP LRQG 

0GAA3X3OG GA MQO OCaC TQ0CDCM3G GKIGMGAEA CTCTSGflCA 1400 
GITG RPT APG DEDT LSQ 
ESL EGPL PQG MKI L.VR 
NHW KAH CPRG .RY SES 

GAM03CTTA AOCBGATOKF G£CIGEG3GT 0033303303 1450 

KPL TR.S SSR TEG ARGE 
SH. PDD PAAG LRV PGA 
EAIN QMI QQQ D.GC PGR 

CAIGOCaiCA CIXTCA CMA OLUJULUj ' iA amTlUflOCA 1500 
RQP MPS PSQS PGY V.P 
SASP CHH PHR APGM FDH 
A P A HAIT LTE PRV CLTI 
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FIG 6 (continued) 
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ATQ332AQ2A QQCATCATC& TCMCATCftC TT33IGC03GG 50 

MGSS HHH HHH SSGL VPR 

C333C3maT AT03CEAGCA TSRCTQSIGG ACaQCAAATC 100 
GSH MASM TGG QQM GRIL 

TAGASCXJnAT TCIGGAGAAT IQ33AGCAAT GIGACACICA GAQ3CIAAGA 150 
ERI LEN WDQ C DTQ TLR 

AAGAftftCGAT TIAIATTUIT CIGCSGiaDC GO^aSGCfiC AATA3CUICT 200 
KKRF IFF CST AWPQ YPL 

TCAftGGGSGA GAAAQCIGGC TICCIGAG3G ASGIKEAAAT TKTAAQTCA 250 
QGR ETWL PEG SIN YNII 

TCTTAOsGCT AGACCTCTIC TCIBGAAAGG AG33CAAATG GAGIGAAGTG 300 
LQL DLF CRKE GKW'SEV 

C3CATATOIGC AAA CmUlT TTCATT&SGA GfiCAACICAC AATEKIGTAA 350 
PYVQ TFF SLR DNSQ LCK 

AAfiGIGIGGT TTAIGCCCIA CfiGGAAQCEC TC&GAGICCA OITOXTAO: 400 
KCG LCPT GSP QSP PPYP 

CCTOJO'lU-L' CIXXXXXSftCT O-TiLUICAA CI&KEAfiQGA a^JLXJLTl'i A 450 
SVP SPT PSST NKD PPL 

AGCCAAA0S3 TOCRAAAQGA GATBGACAAA G33GEAAACA ATGAAGGAAA 500 
TQTV QKE IDK GVNN EPK 

GSGK3XAAT ATI03333AT TKIGCEECCT CCMOC2CTG 550 
SAN IPRL CPL Q A V RGGE 

AATTO333X OlVlGViaCCTT 'I'l'lL'lL'lL'lC £GACTEAAAG 600 

PGP A R V PVPF SLS DLK 

CAAATI&AAA TSGACETM3 TAAKTTCICA GATftACaTIG PCGECmTKT 650 
QIKI DLG KFS DNPD GYI 

IGATCITITA CAM3Q3TEAG GACAATOCIT TGATCIGACA TGGfiGAGATA 700 
DVL QGLG QSF DLT WRDI 

Ts^romcr joaaatcsg aoctamtc caaatcw»g aagtoccoct 750 

MLL LNQ TLTP NER S A A 



CTAACTQCRG CCOGAlSGIT T33CJ3ATCIT H33i M CI CA GIGAG30CAA 
V T A A REF GDL WYLS Q A N 
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FIG 7 (continued) 
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CAATAQ3ATG ACAACAGAGG AAAGAACAAC TCDC30GGC CAGCAQ32SG 850 
NRM TTEE RTT PTG Q Q A V 

TTODCAGIGT AGADOZICAT TG3SACACAG AATCAGAACA TGGAGATCGG 900 
PSV DPH WDTE SEH GDW 

TGOCACAAAC ATTTGCIAAC TIGOGflGCTA GAAQ3ACIGA G3AAAACIAG 950 
CHKH LLT CVL EGLR KTR 

GAAGAAGCCT ATCAATEACT CAATGATGTC CACIATAACA GAG3SAAAG3 1000 
KKP MNYS MMS TIT QGKE 

AAGAAAATCT TACIQCTTTT CIGGACAGAC ATTGAQ3AAG 1050 

ENL TAF LDRL REA LRK 

CATAOTIOGC TGTCAOCIGA ClL'EALL'iGAA GGOCAACTAA TCTTAAAGGA 1100 
HTSL SPD SIE GQLI LKD 

TAAGTITATC ACTCAGTCAG CTOCftGACAT TAGAAAAAAC TTCAAAAGTC 1150 
KFI TQSA ADI RKN FKSL 

TGCCTAAQCT TGO3Q0a3CA CTO3AGCAOC AOCAOCACCA OCACIGAGAT 1200 
PKL AAA LEHH HHH H.D 



0333CIGCTA ACAAAQOC333 AAAGGAAGCT GAGTTGGCIN GTGGCNA 
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FIG 8 
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A lUJL T fl QCR TCACIGGIUG ACfiGCRftAIG OGTCOGAIQC TAGAACGTAT 50 
MASM TGG QQM GRIL ERI 

TCIGGAGAAT TQGGAOCAAT GTGACACICA. GAOXTAAGA AAGAAA03AT 100 
LEN WDQC DTQ TLR KKRF 

TEAraii'iL'iT CIGCAGTACC GCCIQ3C3CAC AATATCCTCT TCAAG3GAGA 150 
IFF CST AWPQ YPL QGR 

GAAAOCIG3C TICCIGAGG3 AAGTATAAAT TA1AACATCA TCTEACAGCT 200 
ETWL PEG SIN YNII LQL 

AGAOJlUriL' TCTCAGAAAGG AQQGCAAATC GSGIGASGIG 02AIATCIGC 250 
DLF CRKE GKW SEV PYVQ 

AAACrnCIT TICATEAAGA GACAACICAC AATTATCrlAA AAAGIGIQGT 300 
TFF SLR DNSQ L C K K C G 

ITRTODOCIA TCAGAGICCA GCItmCAGC aZAGOGTCO: 350 

LCPT GSP QSP PPYP SVP 

ciacmscr ul t iuj uja a cisataagsa ccaaocrrm acozaaacgg 400 

SPT PSST NKD PPL TQTV 

TCCAAAAGGA GAIAGACAAA Q333IAAACA ATGAAOCAAA GAGIGOCAAT 450 
QKE IDK GVNN EPK SAN 

ATiamar TATOCGOXT OCAAGCAGIG AGAG3AGSAG AATIO330QC 500 
IPRL CPL Q A V RGGE FGP 

AocxaGBGiG (xiGia ocrr Triuiuiuiu flsenaaa; caaatiaaaa 550 

ARV PVPF SLS D L K QIKI 

TSGSCCESOa TAAATTCICA GATAACGCIG J!£G3CI!ATAT 'lUKiUTlTlA 600 
DLG KFS DNPD GYI DVL 

CAAGGGTIAG GACAAICCIT TCATCIGACA -IQ3AGAGATA. TAATCITACT 650 
QGLG QSF DLT WRDI MLL 

ACIAAATCAG ACECTAACOC CAAATCAGAG AAGTOOCGCT GIAACIGCAG 700 
LNQ TLTP NER SAA V T A A 

CCCGAGAGTT TOGOGAILTI' TOSTTOCICA GICAGGCOA CAA22GGATC 750 
REF GDL WYLS Q AN N R M 



ACAACAGAQ3 AAAGAACAAC TCXTACAG3C CAGCAGGCAG TKHSGTOT 
TTEE RTT PTG QQAV PSV 
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FIG 8 (continued) 
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AGACOCT£AT TGOSftCfiCAG AATCAGAftCA T33AGATTGG TGCCACAAAC 850 
DPH WDTE SEH GDW CHKH 

ATTIGCTAAC 'I'lULIJlUL'lA GAAG3ACIGA GGAAAACTAG GAAGAAGCCT 900 
LLT CVL EGLR KTR KKP 

AT3AATTACT CAATGATGTC CACTATAACA CftGQaAAAQG AAGAAAATCT 950 
MNYS MMS TIT QGKE ENL 

TACIGLTl'lT CIG3ACAGAC TAAG33A03C ATIGAG3AAG CATAGCICOC 1000 
TAF LDRL REA LRK HTSL 

TGICAOCIGA CTCTATIGAA GGQC&ACTAA TCITAAAGGA TAAGTTTATC 1050 
SPD SIE GQLI LKD KFI 

ACICAGTCAG CIGCflGACAT TAGAAAAAAC TICAAAAGTC TGGZEAAGCT 1100 
TQSA ADI RKN FK SL PKL 

TGCGQCD3CA CTOSAGCAOT AOGACCACTA GCACIGAGAT CD33CTGCTA 1150 
AAA LEHH HHH H.D PAAN 



ACAAAG3CXX3 AAAGGAAGCT GAGTIG3CIG GIGGCA 
KAR KEA ELAG G 
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PIG 9 
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TCTOOGCTGT GCIC C IGA L LC CAQGACftGQC QQCXZKnQOC TCTOCXZAATT 50 
CPLC S.S STG AHCL SQL 
VRC APDP AQA PIA SPNW 
SAV LLI QHRR PLP LPI 

QGGCTAAAQG CITGQCATIG TIXXTQCAGA GCTAAGIQGC T333TDCATC 100 
G.R LAIV PAQ LSA WVHP 
AKG LPL FLHS .VP GFI 
GLKA CHC SCT AKCL GSS 

CHAATGGAGC TGAACACTAG TCACIGGGTT 02A033TICT CTIOZA3GAC 150 
NRA EH. SLGS TVL FHD 

L I E L NTS HWV PRFS SMT 
. SS . T L V TGF HGS LP.P 

CCATGGCITC TAATAGAQCT ATAACACICA CIGC30QC3IC CAAGATTOCA 200 
PWLL I E L .HS LHGP RFH 
HGF . .SY NTH CMV QDSI 
MAS NRA ITLT AWS KIP 

T 1U L T1U G AA TQOGIGAGAC CAAGAAOGGC AQGICAGAGA ACACAAG3CT 250 
SLE SVRP RTP GQR TQGL 
PWN P.D QEPQ VRE HKA 
FLGI RET KNP RSEN T R L 

TGOCAOCKIG TTG3AAGCAG OCO^OCAOCA TTTIGGAAGC AGOOOSOCAC 300 

PPC WKQ PTTI LEA ARH 
CHHV GSS PPP FWKQ PAT 
ATM LEAA HHH FGS SPPL 

TATCT3X3QGA GCICTGGGAG CAAQGAGGCJC AGGTAACAAT TIGGIGAOCA 350 
YLGS SGS KDP R.QF GDH 
I L G A L G A RTP GNN LVTT 
SWE LWE QGPQ VTI W.P 

CX3ft&333AGC TGAATGOGCA AGCAT3AAGG GATCICJCAAA GCAATIQGAA 400 
EGT I R N HEG ISK AIGN 

KGP.ESA TMKG SPK QLE 
RRDL NPQ P.R DLQS NWK 
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PIG 9 (continued) 
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ATSTICCICC CAAQQCAAAA ATGCD3TTAA GftSiUUKi'lLT QGAGAATIQ3 450 

VPP KAK MPLR CIL ENW 
MFLP RQK CP. DVFW RIG 
CSS QGKN APK MYS GELG 

GACCAATTIG ACCCICAGAC AGTAAGAAAA AAATGACTTA TATICTICIG 500 
DQFD PQT VRK K.LI FFC 
TNL TLRQ E K NDL YSSA 

PI. PSD SKKK MTY ILL 



CAGIACCGCC CIGGCCACGA TATCCTCITC AAGGQGGAGA AACCIQQaCT 550 
STA LATI SSS RGR NLAS 
VPP WPR YPLQ GGE TWP 
QYRP GHD ILF KGEK PGL 

CCIGAG03AA GTATAAATTA TA&CACCATC TEACAGCTAG ACLVLUriTlG 600 

.GK YKL H H L TAR PVL 

PEGS INY NTI LQLD LFC 
LRE V.II TPS YS. TCFV 

TAGAAAAQ3A G3CAAAT33A GIGAAGTGCC ATATITACAA ACITICITTT 650 
. K R R QUE . S A IFTN FLF 
RKG GKWS EVP YLQ TFFS 
EKE ANG VKCH IYK LSF 

CATTAAAAGA CAACICGCAA TTATGTTAAC AGTGTGATTT GIGITCCTAC 700 
I K R Q L AI MLT V.F VFLH 
LKD NSQ LC.Q CDL CSY 
H.KT TRN YVN SVIC VPT 

ACG3AAGCCC TCAGATICTA CECCCCACCC G0Q3CATCIC CXX!IGAATCC 750 

GSP QIL LPTP GIS PES 
TEAL RFY SPP PASP LNP 
RKP SDST PHP RHL P. IP 



CTCCOCAACT TATT 764 
L P N L 
S P T Y 
P Q L I 



WO 99/02666 




o 



9/31 



5 



PCT/FR98/01460 



23 / 32 
FIG 10 
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TCIC03CIGT GCIGCIGATC CAGCACAG3C GCCCKTTGCC TCTOOGAATT 50 
CPLC S.S STG AH C L SQL 
VRC APDP A Q A PIA SPNW 
SAV LLI QHRR PLP LPI 

G33CTAAAGG CTIGCCATIG TTCCTGCNZA QCIAAGIQQC TQ33ITCATC 100 
G.R LAIV PAQ LSA WVHP 
AKG LPL FLHS .VP GFI 
GLKA CHC SCT AKCL GSS 

CTAATCGAGC TGAACACTAG TCACIQGGTT GZACX3GTICT CTIOCATGAC 150 
NRA EH. SLGS TVL FHD 

LIEL NTS HWV PRFS SMT 
.SS . T L V TGF HGS LP.P 

CJCaTGQCTIC TAATAGAGCT ATAACACTCA CIQCATG3IC CAAGATIOCA 200 
PWLL IEL . H S LHGP RFH 
HGF SY NTH CMV QDSI 

MAS NRA ITLT AWS KIP 

TICCITG3AA TOOGIGAGAC CAAGAAOODC AGGTCAGAGA ACACAAG33T 250 
SLE SVRP RTP GQR TQGL 
PWN P.D QEPQ VRE HKA 
FLGI RET KNP RSEN TRL 

TG3ZAQCZATG TTGGAftGCAG GCCAOZAOCA. TITIGGAAQC G300CGXAC 300 

PPC WKQ PTTI LEA ARH 
CHHV GSS PPP FWKR PAT 
ATM LEAA HHH FGS GPPL 

TATCITGQGA GCTCI G3 GAG CAAGGAOCOC CAG3TAACAA TTIGGIGAOC 350 
YLGS SGS KDP QVTI W.P 
ILG ALGA RTP R-Q FGDH 
SWE LWE QGPP GNN LVT 

AGGAAQ33AC CTGAATOCGC AAGCATGAAG GGATCICCAA AGCAATIGGA ' 400 
R R D LNPQ P.R DLQ SNWK 
EGT .IR NHEG ISK A I G 
TKGP ESA TMK GSPK QLE 
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FIG 10 (continued) 
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A A LLUl' I C C IC CCAAGG2AAA. AATGCCCCIA AGAIGEATIC TOG&G&ATIG 450 

CSS QGK NAPK M Y S GEL 
NVPP KAK MPL R C I L ENW 
MFL PRQK CP. DVF WRIG 

GGAQCAATCT GACCCICAGA CAGTAAGAAA AAA&ATGACT TATATlLTiL" 500 
GPI. PSDSKKKNDLYSS 
DQS DPQT VRK KMT YILL 
TNL TLR Q.EK K.L IFF 

T3CAGTACCG CCTG3CCACG GATATCCTCT TCAAG33GGA GAAACCTG3C 550 
AVP PGHG YPL QGG ETWP 
QYR L A T DILF KGE KPG 
CSTA WPR ISS S ' R G R NLA 

CTCCTGAQ33 A&GIAIAAAT TATAACACCA TCITACAGCT AGACCTGTIT 600 

PEG SIN YNTI LQL DLF 
LLRE V.I ITP SYS. TCF 
S.G KYKL .HHLTARPVL 

T3IBGAAAAG GAG3CAAATC GAGIGAAGIG 03flS^FITAC AAALTi'lULT 650 
CRKG GKW SEV PYLQ TFF 
VEK EANG VKC HIY KLSF 
.KR RQM E.SA IFT NFL 

TICATI5AAA GACAACICGC AAT33OT3IAA ACAGTGTGAT TlUlUlUL'iA 700 
SLK D N S Q LCK QCD LCPT 
H.K TTR NYVN SVI CVL 
FIKR QLA I M . TV.F VSY 

CAGGAAGCCC TCAGATCI&C CDCCCT^CCC CGQCKCCTCC CT3ACICCTT 750 

GSPQIYLPTPASP . L L 
QEAL RST SLP RHLP DSF 
RKP SDLP PYP GIS LTPS 

CCCCAACIAA TAAGGACCCA CTICAGCCCA AACAGICCAA AAGGACATftG ' 800 
PQLI RTH FSP NSPK GH 
PN._.GPT SAQ TVQ KDI 
PTN KDP LQPK QSK RT. 
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FIG 11 
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Q3CKTTGATA GCACCCATCA GA3Q3CCAAA TCftLL'UALLTiA CT3GACGAOG 50 
GIDS THQ MAK SLFT GPG 
A L I APIR WPN HYL LDQA 
H. . HPS DGQI I I Y WTR 

OJI'ITI CAAA. ACTATCAAGC AGATAQQ3GC OGflGAAGCAT GGGAAAGAAA 100 
LFK TIKQ IGP VKH AKEI 
FSKLSSR.GP .SMPKK 
PFQN YQA DRA REAC QRN 

TAATOGCC1G COTERTOGCC ATGTICCTIC AGGAGAACAA AGAACAQ3X 150 

IPC L I A MFLQ ENK E Q A 
. SPA LSP CSF RRTK NRP 
NPL PYRH VPS"GEQ RTGH 

ATEACGCAGG GGA&GftCIQG CAACEAGATT TIRDDCftCSff GGQCAAAT3T 200 
ITQG KTG N.I LPTW PNV 
LPR GRLA TRF YPH GQMS 
YPG EDW QLDF THM AKC 

CaOGGRTnC AGCMCEACT AGICIGGGCA GATACTTICA CIQGTIGGGT 250 
RDF SIY. SGQ ILS LVGW 
GIS AST SLGR YFH WLG 
QGFQ HLL VWA DTFT GWV 

GGaGICTICT GCTIGTAGGA CAGAAAAGAC CGAAGAGGTA ATAAAQ3CAC 300 

SLL LVG QKRP KR. .RH 
GVFS L.D RKD PRGN KGT 
ESS PCRT EKT QEV IKAL 

TAATCAAATA ATICCCAGAT TIGGACTICC CnZAOGATIA CAGG3IGACA 350 
. . N N SQI WTS PRIT G.Q 
NEI IPRF GLP PGL QGDN 
MK. FPD LDFP QDY RVT 
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PIG 11 (continued) 
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AIX33QOCOGC TTICAAGQCT GCAGTCAADOC AQ33AGTATC OGAQ JIUI'IA 400 
WPR FQGC S N P GSI PGVR 
GPA FKA AVTQ GVS QVL 
MAPL SRL Q.P REYP RC. 

GGCATACAAT ATCACITACA CIGIGOCIQG AG30CACAAT CCTCCAGAAA 450 

HTI SLT LCLE ATI LQK 
GIQY HLH CAW RPQS SRK 
AYN ITYT VPG GHN PPEK 

AGTCAAGAAA ATGAATGAAA CACICAAAGA TCTAAAAAAG CTAACCCAAG 500 
SQEN E.N TQR SKKA NPR 
VKK MNET LKD Xi K K LTQE 
SRK .MKHSKI . K S . P K 

AAAO0CACAT TGCMGAOCT GITCIGTIGC CTATAACCTT ACTAAGAATC 550 
NPH CMTC SVA YNL TKNP 
THI A.P VLLP ITL LRI 
KPTL HDL FCC L.PY . E S 

CATAACTATC CDOCAAAAAG CAQ3ACITAG COCATACGAG ATOCIATAIG 600 

.LS PKK QDLA HTR CYM 
HNYP PKS RT. PIRD AIW 
ITI PQKA GLS PYE MLYG 

GATGGGCTIT CCEAACCAAT GAULT1U1UC TIGACIGAGA AATG3CCAAC 650 
DGLS .PM TLC LTEK WPT 
MAF PNQ. PCA L R NGQL 

WPF LTN DLVL D.E MAN 

TTAGTIGCAG ACATCACCIC CmGCCAAA TATCAACAAG TTCTTAAAAC 700 
. L Q TSPP .PN INK FLKH 
SCR HHL LSQI STS S.N 
LVAD ITS LAK YQQV LKT 
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FIG 11 (continued) 



10 



20 



30 



40 



50 



1234567890 1234567890 1234567890 1234567890 1234567890 

AOTIGIQaar GAGAGGAG33 AAAGGAACIA TTCCACOTIG 750 
HRE PVP ERRE RNY STL 
ITGN LSP RGG KGTI PPW 
SQG TCPR EEG KEL FHPG 

GTG&CA3G 758 
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Docket No.: 103514 

DECLARATION AND POWER OF ATTORNEY 
UNDER 35 USC §371(c)(4) FOR 
PCT APPLICATION FOR UNITED STATES PATENT 

As a below named inventor, I hereby declare that: 

my residence, post office address and citizenship are as stated below under my name; 

I verily believe I am the original, first and sole inventor (if only one name is listed below) or an original, 
first and joint inventor (if plural names are listed below) of the subject matter which is claimed and for which a 
patent is sought, namely the invention entitled: RETROVIRAL NUCLEIC MATERIAL AND NUCLEOTIDE 
FRAGMENTS. IN PARTICULAR ASSOCIATED WITH MULTIPLE SCL EROSIS AND/OR RHEUMATOID 
ARTHRITIS, FOR DIAGNOSTIC. PROPHYLACTIC AND THERA PEUTIC USES described and claimed in 
ir^rn^rmnl ^plication number PCT/FR98/01460 filed July 7, 1998 . 

I have reviewed and understand the contents of the above-identified specification, including the claims, as 
amended by any amendment referred to above. 

I acknowledge the duty to disclose to the Office all iriformation known to me to be material to 
patentability as defined in Title 37, Code of Federal Regulations §1.56. 

Under Title 35, U.S. Code §119, the priority benefits of the following foreign apphcation(s) filed within 
one year prior to my international application are hereby claimed: 

French Patent Application No. 97 G8816 filed July 7, 1997 

The following application^) for patent or inventor's certificate on this invention were filed in countries 
foreign to the United States of America either (a) more than one year prior to my international application, or (b) 
before the filing date of the above-named foreign priority application^): 



' I hereby appoint the following as my attorneys of record with full power of substitution and revocation to 
prosecute this application and to transact all business in the Patent Office: 

James A. Oliff, Reg, No. 27.075 : William P. Berridge, Reg, No. 2QJ124; 
Kirk M. Hudson, Reg. No. TL5&; Thomas J. Pardini, Reg. No,3£MU; 
Edward P. Walker, Reg. No.J&450; Robert A. Miller, Reg. No. 32JZ1; 
Mario A. Costantino, Reg. No. 33^65; and Caroline D. Dennison, Reg. No. 34,494. 

ALL CORRESPONDENCE IN CONNECTION WITH THIS APPLICATION SHOULD BE SENT TO OLffiE& 
BERRIDGE, PLC. P.O. BOX 19928, ALEXANDRIA, VIRGINIA 22320, TELEPHONE (703) 836-6400 . 

I hereby declare that I have reviewed and understand the contents of this Declaration, and that all 
statements made herein of my own knowledge are true and that all statements made on information and belief are 
believed to be true; and further that these statements were made with the knowledge that willful false statements 
and the like so made are punishable by fine or imprisonment, or both, under Section 1001 of Title 18 of the United 
States Code and that such willful false statements may jeopardize the validity of the application or any patent issued 
thereon. 
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SEQUENCE LISTING 



(2) INFORMATION FOR SEQ ID NO : 68: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 68: 
GACTCGCTGC AGATCGATTT TTTTTTTTTT TTTT 

(2) INFORMATION FOR SEQ ID NO: 69: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 0 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69: 
GCCATCAAGC CACCCAAGAA CTCTTAACTT 

(2) INFORMATION FOR SEQ ID NO : 70: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 0 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70: 
CCAATAGCCA GACCATTATA TACACTAATT 

(2) INFORMATION FOR SEQ ID NO: 112: 
(i) SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 310 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

REPLACEMENT SHEET (RULE 26) 
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(ii) MOLECULE TYPE: cDNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 112 



60 



GCTTATAGAA GGACCCCTAG TATGGGGTAA TCCCCTCTGG GAAACCAAGC CCCACTACTC 
ACCAGCAAAA ATAGAATAGG AAACCTCACA AGGACATACT TTCCTCCCCT CCACATGGCT 120 
AGCCACTGAG CAAGCAAAAA TACTTTCACC TGCAGCTAAC CAACAGAAAT TACTTAAAAC 180 
CCTTCACCAA ACCTTCCACT TAGGCATTGA TAGCACCCAT CAGATGGCCA AATTATTATT 240 
TACTGGACCA GGC C TTTTCA AAACTATCAA GAAGATAGTC AGGGGCTGTG AAGTGTGCCA 300 
AAGAAATAAT 310 
(2) INFORMATION FOR SEQ ID NO: 113: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 103 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 113: 
Leu lie Glu Gly Pro Leu Val Trp Gly Asn Pro Leu Trp Glu Thr Lya 
15 10 15 

Pro Gin Tyr Ser Ala Gly Lys He Glu Xaa Glu Thr Ser Gin Gly His 

20 25 30 

Thr Phe Leu Pro Ser Arg Trp Leu Ala Thr Glu Glu Gly Lys He Leu 

35 40 45 

Ser Pro Ala Ala Aen Gin Gin Ly« Leu Leu Lys Thr Leu His Gin Thr 

SO 55 60 

Phe His Leu Gly He Asp Ser Thr His Gin Met Ala Lys Leu Leu Phe 
65 70 75 AO 

Thr Gly Pro Gly Leu Phe Lys Thr He Lys Lys He Val Arg Gly Cys 

85 90 95 

Glu Val cys Gin Arg Asn Asn 
100 

(2) INFORMATION FOR SEQ ID NO: 114: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 635 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 114: 
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CCCTGTATCT TTAACCTCCT TGTTAAGTTT GTCTCTTCCA GAATCAAAAC TGTAAAACTA 60 

CAAATTGTTC TTCAAATGGA GCACCAGATG GAGTCCATGA CTAAGATCCA CCGTGGACCC 120 

CTGGACCGGC CTGCTAGCCC ATGCTCCGAT GTTAATGACA TTGAAGGCAC CCCTCCCGAG ISO 

CAAATCTCAA CTGCACAACC CCTACTATGC CCCAATTCAG CGGGAAGCAG TTAGAGCCGT 240 

CATCAGCCAA CCTCCCCAAC AGCACTTGGG TTTTCCTCTT GAGAGGGGGG ACTGAGAGAC 300 

AGGACTAGCT GG ATTTCCTA GGCCAACGAA GAATCCCTAA GCCTAGCTGG GAAGGTGACT 360 

GCATCCACCT CTAAACATGG GGCTTCCAAC TTAGCTCACA CCCGACCAAT CAG AG AG CT C 420 

ACTAAAATGC TAATTACGCA AAAATAGGAG GTAAAGAAAT AG CC AATCAT CTATTGCCTG 480 

AGAGCACAGC GGGAGGGACA AGGATCGGGA TATAAACCCA GGCATTCGAG CCCGCAACCG 540 

CAACCCCCTT TGGCTCCCCT CCCTTTGTAT GGGCGCTCTG TTTTCACTCT ATTTCACXCT 600 

ATTAAATCTT GCAACTGAAA AAAAAAAAAA AAAAA 635 

(2) INFORMATION FOR SEQ ID NO: 115 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 77 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 115: 
Pro cys lie Phe Asn Leu Leu Val Lys Phe Val Ser Ser Arg lift Lys 
15 10 15 

Thr Val Lys Leu Gin He Val Leu Gin Met: Olu His Gin Met Glu Ser 

20 25 30 

Met Thr Lys lie His Arg Gly Pro Leu Asp Arg Pro Ala Ser Pro Cys 

35 40 45 

Ser Asp Val Asn Asp He Glu Gly Thr Pro Pro Glu Glu He Ser Thr 

SO 5S 60 

Ala Gin Pro Leu Leu Cys Pro Asn Ser Ala Gly Ser Ser 
65 70 75 
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(2) INFORMATION FOR SEQ ID NO : 116: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 116: 
TGGGGTTCCA TTTGTAAGAC CATCTGTAGC TT 32 



(2) INFORMATION FOR SEQ ID NO: 117: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1481 base pairs 

(B) TYPE: nucleotide 

15 (C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 117: 
ATGCCCCTCC CTTATCATAC TTTTCTCTTT ACTGTTCTCT TACCCCCTTT CCCTCTCACT 60 
CCACCCCCTC CATGCTGCTC TACAACCACT AGCTCCCCTT ACCAAGAGTT TCTATGAAGA 120 
ACGCGCCTTC CTGGAAATAT TGATGCCCCA TCATATAGGA GTTTATCXAA GGGAAACTCC 180 
ACCTTCACTG CCCACACCCA TATCCCCCGC AACTGCTATA ACTCTCCCAC TCTTTCCATG 240 
CATGCAAATA CTCATTATTG GACAGGGAAA ATGATTAATC CTAGTTGTCC TGGAGGACTT 300 
GGAGCCACTG TCTGTTGGAC TTACTTCACC CATACCAGTA TGTCTGATGG GGGTGGAATT 360 
CAAGGTCAGG CAAGAGAAAA ACAAGTAAAG GAAGCAATCT CCCAACTGAC CCGGGGACAT 420 
AGCACCCCTA CCCCCTACAA AGGACTAGTT CTCTCAAAAC TACATGAAAC CCTCCGTACC 480 
CATACTCGCC TGGTGAGCCT ATTTAATACC ACCCTCACTC CCCTCCATGA GGTCTCACCC S40 
CAAAACCCTA CTAACT G TTG GATGTCCCTC CCCCTGCACT TCAGGCCATA CATTTCAATC 600 
CCTGTTCCTG AACAATGGAA CAACTTCAGC ACAGAAATAA ACACCACTTC CGT TT T AGTA 660 
GGACCTCTTC TTTCCAATCT GGAAATAACC CATACCTCAA ACCTCACCTG TGTAAAATTT 720 
AGCAATACTA TAGACACAAC CAGCTCCCAA TGCATCAGGT GGGTAACACC TCCCACACGA 780 
ATAGTCTGCC TACCCTCAGG AATATTTTTT GTCTGTGGTA CCTCAGCCTA XCATTGTTTG 840 
AATGGCTCTT CAGAATCTAT GTGCTTCCTC TCATTCTTAG TGCCCCCTAT GACCATCTAC 900 
ACTGAACAAG ATTTATACAA TCATCTCGTA CCTAAGCCCC ACAACAAAAG AGTACCCATT 960 
CTTCCTTTTG TTATCAGACC AGGACTCCTA GCCAGACTAG GTACTGGCAT TGGCAGTATC 1020 

20 
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ACAACCTCTA CTCAGTTCTA CTACAAACTA 
GTCACTCACT CCCTGGTCAC CTTG CAAGAT 
CAAAATCGAA GAGCTTTAGA CTTGCTAACC 
GGAGAAGAAC GCTGTTATTA TGTTAATCAA 
ATTCCACATC GAATACAATC TAGAGCAGAG 
CTCAGCCAAT GGATGCCCTG GGTTCTCCCC 
TTACTCCTCT TTGGACCCTG TATCTTTAAC 
GAAGCTGTAA AGCTACACAT GGTCTTACAA 
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TCTCAAGAAA TAAATGGTGA CATCGAACAG 1080 
CAACTTAACT CCCTAGCAGC AGTAGTCCTT 1140 
GCCAAAAGAG CGGGAACCTG TTTATTTTTA 1200 
TCCAGAATTG TCACTGAGAA AGTTAAAGAA 1260 
GAGCTTCAAA ACACCGAACG CTGGCGCCTC 1320 
TTCTTAGGAC CTCTAGCAGC TCTAATATTG 1380 
CTCCTTGTTA AGTTTGTCTC TTCCAGAATT 1440 
ATGGAACCCC A 1481 



(2) INFORMATION FOR SEQ ID NO: 118: 

(i) SEQUENCE CHARACTERISTICS: 

5 (A) LENGTH: 493 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 118: 

Met Ala Leu Pro Tyr His Thr Phe Leu Phe Thr Val Leu Leu Pro Pro 

IS 10 15 

Phe Ala Leu Thr Ala Pro Pro Pro Cys Cys Cys Thr Thr Ser Ser Ser 

20 25 30 

Pro Tyr Gin Glu Phe Leu Xaa Arg Thr Arg Leu Pro Gly Asn lie Asp 

35 40 45 

Ala Pro Ser Tyr Arg Ser Leu Ser Lys Gly Asn Ser Thr Phe Thr Ala 

50 55 60 

His Thr His Met Pro Arg Asn Cys Tyr Asn Ser Ala Thr Leu Cys Met 
65 70 75 80 

His Ala Asn Thr His Tyr Trp Thr Gly Lys Met lie Aen Pro Ser Cys 

85 90 95 

Pro Gly Gly Leu Gly Ala Thr Val Cys Trp Thr Tyr Phe Thr His Thr 

100 105 110 

Ser Met Ser Asp Gly Gly Gly lie Gin Gly Gin Ala Arg Glu Lys Gin 

115 120 125 

Val Lys Glu Ala lie Ser Gin Leu Thr Arg Gly His Ser Thr Pro Ser 

130 135 140 

Pro Tyr Lys Gly Leu Val Leu Ser Lye Leu His Glu Thr Leu Arg Thr 
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145 



150 



155 



160 



His Thr Arg Leu Val Ser Leu Phe Asn Thr Thr Leu Thr Arg Leu His 

1SS 170 175 

Glu Val Ser Ala Gin Asn Pro Thr Asn Cys Trp Met Cys Leu Pro Leu 

180 185 190 

His Phe Arg Pro Tyr lie Ser lie Pro Val Pro Glu Gin Trp Asn Asn 

195 200 205 

Phe Ser Thr Glu He Asn Thr Thr Ser Val Leu Val Gly Pro Leu Val 

210 215 220 

Ser Asn Leu Glu He Thr His Thr Ser Asn Leu Thr Cys Val Lys Phe 
225 230 235 240 

Ser Asn Thr He Asp Thr Thr Ser Ser Gin Cys He Arg Trp Val Thr 

245 250 255 

Pro Pro Thr Arg He Val Cys Leu Pro Ser Gly He Phe Phe Val Cys 

260 265 270 

Gly Thr Ser Ala Tyr His Cys Leu Asn Gly Ser Ser Glu Ser Met Cys 

275 280 285 

Phe Leu Ser Phe Leu Val Pro Pro Met Thr He Tyr Thr Glu Gin Asp 

290 295 300 

Leu Tyr Asn His Val Val Pro Lys Pro His Asn Lys Arg Val Pro He 
305 310 315 320 

Leu Pro Phe Val He Arg Ala Gly Val Leu Gly Arg Leu Gly Thr Gly 

325 330 335 

He Gly ser He Thr Thr ser Thr Gin Phe Tyr Tyr Lys Leu ser Gin 

340 345 350 

Glu He Asn Gly Asp Met Clu Gin Val Thr Asp Ser Leu Val Thr Leu 

355 360 365 

Gin Asp Gin Leu Asn Ser Leu Ala Ala Val Val Leu Gin Asn Arg Arg 

370 375 380 

Ala Leu Asp Leu Leu Thr Ala Lys Arg Gly Gly Thr Cys Leu Phe Leu 
385 390 395 400 

Gly Glu Glu Arg Cys Tyr Tyr val Asn Gin Ser Arg He Val Thr Glu 

405 410 415 

Lys Val Lys Clu He Arg Asp Arg He Gin Cys Arg Ala Glu Glu Leu 

420 425 430 

Gin Asn_ Thr Glu Arg Trp Gly Leu Leu Ser Gin Trp Met Pro Trp Val 
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435 440 
Leu Fro Phe Leu Gly Pro Leu Ala 

450 455 
Gly Pro Cys lie Phe Asn Leu Leu 
465 470 
Glu Ala Val Lys Leu Gin Met Val 
485 



445 

Ala Leu He Leu Leu Leu Leu Phe 
460 

Val Lys Phe Val Ser Ser Arg He 
475 480 
Leu Gin Met Glu Pro 
490 



(2) INFORMATION FOR SEQ ID NO : 119: 

(i) SEQUENCE CHARACTERISTICS: 

5 (A) LENGTH: 32 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 119: 

TCAAAATCGA AGAGCTTTAG ACTTGCTAAC CG 

(2) INFORMATION FOR SEQ ID NO: 120: 
(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 1329 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) 


MOLECULE TYPE: cDNA 








(xi) 


SEQUENCE DESCRIPTION: SEQ ID NO: 


120: 




TCAAAATCGA 


AGAGCTTTAG 


ACTTGCTAAC 


CGCCAAAAGA 


GGGGGAACCT 


GTTTATTTTT 


60 


AGGGGAAGAA 


TGCTGTTAGT 


ATGTTAATCA 


ATCTGGAATC 


ATTACTGAGA 


AAGTTAAAGA 


120 


AATTTGAGAT 


CGAATATAAT 


GTAGAGCAGA 


GGACCTTCAA 


AACACTGCAC 


CCTGGGGCCT 


ieo 


CCTCAGCCAA 


TGGATGCCCT 


GGACTCTCCC 


CTTCTTAGGA 


CCTCTAGCAC 


CTATAATATT 


240 


TTTACTCCTC 


T T TGGACCCT 


GTATCTTCAA 


CTTCCTTGTT 


AAGTTTGTCT 


CTTCCAGAAT 


300 


TGAAC CTCTA 


AAGCTACAAA 


TAGTTCTTCA 


AATGGAACCC 


CAGATGCAGT 


CCATGACTAA 


360 


AATCTACCGT 


CGACCCCTGG 


ACCGGCCTGC 


TAGACTATGC 


TCTGATGTTA 


ATGACATTGA 


420 


AGTCACCCCT 


CCCGAGGAAA 


TCTCAACTGC 


ACAACCCCTA 


CTACACTCCA 


ATTCAGTAGG 


480 


AAGCAGTTAG 


AGCAGTTGTC 


AGCCAACCTC 


CCCAACAGTA 


CTTGGGTTTT 


CCTGTTGAGA 


540 


GGGTGGACTG 


AGAGACAGGA 


CTAGCTGGAT 


TTCCTAGGCT 


GACTAAGAAT 


CCCNAAGCCT 


600 
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ANCTGGGAAG 


GTGACCGCAT 


CCATCTTTAA 


ACATGGGGCT 


TGCAACTTAG 


CTCACACCCG 


660 


ACCAATCAGA 


GAGCTCACTA 


AAATGCTAAT 


CAGGCAAAAA 


CAGGAGGTAA 


AG CAATAG C C 


720 


AATCATCTAT' 


TGCCTGAGAG 


CACAGCGGGA 


AGGACAAGGA 


TTGGGATATA 


AACTCAGGCA 


780 


TTCAAGCCAG 


CAACAGCAAC 


ccccrrrcGc 


TCCCCTCCCA 


TTGTATGGGA 


GCTCTGZTTT 


840 


CACTCTATTT 


CACTCTATTA 


AATCATCCAA 


CTGCACTCTT 


CTGGTCCGTG 


TTTTTTAXGG 


900 


CTCAAGCTGA 


CCTTTTGTTC 


CCCATCCACC 


ACTGCTGTTT 


GCCACCGTCA 


CAGACCCGCT 


960 


GCTGACTTCC 


ATCCCTTTGG 


ATCCAGCAGA 


GTGTCCACTG 


TGCTCCTGAT 


CCAGCGAGGT 


X020 


ACCCATTGCC 


ACTCCCGATC 


AGGCTAAAGG 


CTTGCCATTG 


TTCCTGCATG 


GCTAAGTGCC 


1080 


TGGGTTTGTC 


CTAATAGAAC 


TGAACACTGG 


TCACTGGGTT 


CCATGCTTCT 


CTTCCATGAC 


1140 


CCACGGCTTC 


TAATACAGCT 


ATAACACTCA 


CCGCATGGCC 


CAAGATTCCA 


TTCCTTGGTA 


1200 


TCTGTGAGGC 


CAAGAACCCC 


AGGTCAGAGA 


ANGTGAGGCT 


TCCCACCATT 


TGGGAAGTGG 


1260 


CCCACTGCCA 


TTTTGGT AG C 


GGCCCACCAC 


CATCTTGGGA 


G CTGTGGG AG 


CAAGGATCCC 


1320 


CCACTAACA 












1329 



10 



(2) INFORMATION FOR SEQ ID NO: 121: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 162 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 121: 
Gin Asn Arg Arg Ala Leu Asp Leu Leu Thr Ala Lys Arg Gly Gly Thr 
15 10 15 

Cys Leu Phe Leu Gly Glu Glu Cys Cys Xaa Tyr Val Asn Gin Ser Gly 

20 25 30 

lie lie Thr Glu Lye Val Lye Glu lie Xaa Asp Arg lie Xaa Cye Arg 

35 40 45 

Ala Glu Asp Leu Gin Asn Thr Ala Pro Trp Gly Leu Leu Ser Gin Trp 

50 55 60 

Met Pro Trp Thr Leu Pro Phe Leu Gly Pro Leu Ala Ala lie lie Phe 
65 70 75 80 

Leu Leu Leu Phe Gly Pro Cys lie Phe Asn Phe Leu Val Lys Phe Val 

85 90 95 

Ser Ser Arg He Glu Ala Val Lys Leu Gin He Val Leu Gin Met Glu 
100 105 110 
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Pro Gin Met Gin Ser Met Thr Lys lie Tyr Arg Gly Pro Leu Asp Arg 

115 120 125 

Pro Ala Arg Leu Cys Ser Asp Val Asn Asp lie Glu Val Thr Pro Pro 

130 135 140 

Glu Glu lie Ser Thr Ala Gin Pro Leu Leu His Ser Asn Ser Val Gly 
145 150 155 160 

Ser Ser 

(2) INFORMATION FOR SEQ ID NO: 122: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 21 base pairs 

5 (B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 122: 
10 GGCATTGATA GCACCCATCA G 21 

(2) INFORMATION FOR SEQ ID NO: 123: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 21 base pairs 

15 (B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 123: 
20 CATGTCACCA GGGTGGAATA G 21 

(2) INFORMATION FOR SEQ ID NO : 124: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: base pairs 

25 (B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 124: 
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(2) INFORMATION FOR SEQ ID NO: 124: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 758 base pairs 

(B) TYPE: nucleotide 

5 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) 


MOLECULE TYPE: cDNA 








(xi) 


SEQUENCE DESCRIPTION: SEQ ID NO: 


124: 




GGCATTGATA 


CCACCCATCA 


CATGGCCAAA 


TCATTATTTA 


CTGGACCAGG 


CCTTTTCAAA 


60 


ACTATCAAGC 


AGATAGGGCC 


CGTGAAGCAT 


GCCAAAGAAA 


TAATCCCCTG 


CCTTATCGCC 


120 


ATGTTCCTTC 


AGGAGAACAA 


AGAACAGGCC 


ATTACCCAGG 


GGAAGACTGG 


CAACTAGATT 


ISO 


TTACCCACAT 


GGCCAAATGT 


CAGGGATTTC 


ACCATCTACT 


AGTCTGGGCA 


GAXACTTTCA 


240 


CTGGTTGGGT 


GGAGTCTTCT 


CCTTGTAGGA 


CAGAAAAGAC 


CCAAGAGGTA 


ATAAAGGCAC 


300 


TAATGAAATA 


ATTCCCAGAT 


TTGGACTTCC 


CCCAGCATTA 


CACGCTGACA 


ATCGCCCCCC 


360 


TTTCAAGGCT 


GCAOTAACCC 


ACGGACTATC 


CCAGCTCTTA 


GGCATACAAT 


ATCACTTACA 


420 


CTGTCCCTGG 


AGGCCACAAT 


CCTCCAGAAA 


AGTCAAGAAA 


ATGAATGAAA 


CACTCAAAGA 


460 


TCTAAAAAAG 


CTAACCCAAG 


AAACCCACAT 


TGCATGACCT 


G TTCTGTTG C 


CTATAACCTT 


540 


ACTAAGAATC 


CATAACTATC 


CCCCAAAAAG 


CAGGACTTAG 


CCCATACGAG 


ATGCTATATG 


600 


GATGGCCTTT 


CCTAACCAAT 


GACCTTGTGC 


TTGACTGAGA 


AATGGCCAAC 


TTAGTTGCAG 


660 


ACATCACCTC 


CTTAGCCAAA 


TATCAACAAG 


TTCTTAAAAC 


ATCACAGGGA 


ACCTGTCCCC 


720 


GAG AGGAGG G 


AAAGGAACTA 


TTCCACCCTG 


GTGACATG 






758 



10 (2) INFORMATION FOR SEQ ID NO: 126: 



(i) 



SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleotide 



15 



(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 
(xi) SEQUENCE DESCRIPTION: 
CGGACATCCA AAGTGATGGG AAACG 



SEQ ID NO: 126: 



25 



20 



(2) 



INFORMATION FOR SEQ ID NO: 127: 
(i) SEQUENCE CHARACTERISTICS : 



25 



(A) LENGTH: 26 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 127: 



REPLACEMENT SHEET (RULE 26) 



GGACAGGAAA GTAAGACTGA GAAGGC 



26 



(2) INFORMATION FOR SEQ ID NO: 12 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 128: 
CCTAGAACGT ATTCTGGAGA ATTGGG 26 

(2) INFORMATION FOR SEQ ID NO: 129: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 129: 
TGGCTCTCAA TGGTCAAACA TACCCG 2i 

(2) INFORMATION FOR SEQ ID NO : 13 0: 

(i) SEQUENCE CHARACTERISTICS:^ 

(A) LENGTH: 1511 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 130: 

CCTAGAACGT ATTCTGGAGA ATTGGGACCA ATGTGACACT CAGACGCTAA GAAAGAAACG 
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ATTTATATTC 


TTCTGCAGTA 


CCGCCTGGCC 


ACAATATCCT 


CTTCAAGGGA 


GAGAAACCTG 


120 


GCTTCCTGAG 


GGAAGTAXAA 


ATTATAACAT 


CATCTTACAG 


CTAGACCTCT 


TCTCTAGAAA 


180 


GGAGGGCAAA 


7GCAGTGAAG 


TGCCATATCT 


GCAAACTTTC 


TTTTCATTAA 


GAGACAACTC 


240 


ACAATTATGT 


AAAAAGTGTG 


GTTTATGCCC 


TACAGGAAGC 


CCTCAGAGTC 


CACCTCCCTA 


300 


CCCCAGCGTC 


CCCTCCCCGA 


CTCCTTCCTC 


AACTAATAAG 


GACCCCCCTT 


TAACCCAAAC 


360 


GGTCCAAAAG 


GAG AT AG ACA 


AAGGGG7AAA 


CAATGAACCA 


AAGAGTGCCA 


ATATTCCCCG 


420 


ATTATGCCCC 


CTCCAAGCAG 


TGAGAGGAGG 


AGAATTCGGC 


CCAGCCAGAG 


TGCCTGTACC 


480 


TTTTTCTCTC 


TCAGACTTAA 


AGCAAATTAA 


AATAGACCTA 


GGTAAATTCT 


CAGATAACCC 


540 


TGACGGCTAT 


ATTGATGTTT 


TACAAGGGTT 


AGGACAATCC 


TTTG ATCTG A 


CATGG AG AG A 


600 


TATAATCTTA 


CTACTAAATC 


AG ACACTAA C 


CCCAAATGAG 


AGAAGTGCCG 


CTGTAACTGC 


660 


AG CCCG AG AG 


TTTGGCGATC 


TTTGGTATCT 


CAGTCAGGCC 


AACAATAG G A 


TGACAACAGA 


720 


GGAAAGAACA 


RCTCCCACAG 


GCCAGCAGGC 


AGTTCCCAGT 


GXAGACCCTC 


ATTGGGACAC 


760 


AGAAXCAGAA 


CATGGAGATT 


GGTGCCACAA 


ACATTTGCTA 


ACTTCCGTGC 


TAGAAGGAC7 


840 


GAGGAAAACT 


AGGAAGAACC 


CTATGAATTA 


CTCAATGATG 


TCCACTATAA 


CACAGGGAAA 


900 


GGAAGAAAAT 


CTTACTGCTT 


TTCTCGACAG 


ACTAAGCGAG 


CCATTGAGGA 


AGCATACCTC 


960 


CCTGTCACCT 


GACTCTATTG 


AAGGCCAACT 


AATCTTAAAG 


GATAAGTTTA 


TCACTCACTC 


1020 


ACCTCCAGAC 


ATTAGAAAAA 


ACTTCAAAAG 


TCTGCCTTAG 


GCCCGGAGCA 


GAACTTAGAA 


1080 


ACCCTATTTA 


ACTTGGCATC 


CTCAGTTTTT 


TATAATAGAG 


A7CAGGAGGA 


GCAGCCGAAA 


1140 


CGGGACAAAC 


GGGA7AAAAA 


AAAAAGGGGG 


GGTCCACTAC 


TTTAGTCATG 


GCCCTCAGGC 


1200 


AAGCAGACT7 


TGGAGGCTCT 


GCAAAAGGGA 


AAAGCTGGGC 


AAATCAAATG 


CCTAATAGGG 


1260 


CTGG CTTCCA 


GTGCGGTCTA 


CAAGGACACT 


TTAAAAAAGA 


TTATCCAAGT 


AGAAATAAGC 


1320 


CGCCCCCTTG 


TCCATGCCCC 


TTACGTCAAG 


GG AATCACTC 


GAAGGCCCAC 


TGCCCCAGGG 


1380 


GATGAAGATA 


CTCTGAGTCA 


GAAGCCATTA 


ACCAGATGAT 


CCAGCAGCAG 


GACTGAGGGT 


1440 


GCCCGGGGCG 


ACCGCCAGCC 


CATGCCATCA 


CCCTCACAGA 


GCCCCGGGTA 


TGTTTGACCA 


1500 


TTGAGAGCCA 


A 










1511 



(2) INFORMATION FOR SEQ ID NO : 131: 
(i) SEQUENCE CHARACTERISTICS: 



(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 131: 
Lou Clu Arg He Leu Glu Asn Trp Asp Gin Cyc Asp Thr Gin Thr Leu 



5 



(A) 
(B) 



LENGTH: 3 52 amino acids 
TYPE: amino acid 



1 



5 



10 



15 



10 



REPLACEMENT SHEET (RULE 26) 



- 13 - 

Arg Lys Lys Arg Phe lie Phe Phe Cys Ser Thr Ala Trp Pro Gin Tyr 

20 25 30 

Pro Lou Gin Gly Arg Glu Thr Trp Leu Pro Glu Gly Ser He Asn Tyr 

35 40 45 

Asn lie lie Leu Gin Leu Asp Leu Phe Cys Arg Lys Glu Gly Lys Trp 

50 55 60 

Ser Glu Val Pro Tyr Val Gin Thr Phe Phe Ser Leu Arg Asp Asn Ser 
65 70 75 80 

Gin Leu Cys Lys Lys cys Gly Leu Cys Pro Thr Gly Ser Pro Gin Ser 

85 90 95 

Pro Pro Pro Tyr Pro Ser Val Pro Ser Pro Thr Pro Ser Ser Thr Asn 

100 105 HO 

Lys Asp Pro Pro Leu Thr Gin Thr Val Gin Lys Glu He Asp Lys Gly 

115 120 125 

Val Asn Asn Glu Pro Lys Ser Ala Asn He Pro Arg Leu Cys Pro Leu 

130 135 140 

Gin Ala val Arg Gly Gly Glu Phe Gly Pro Ala Arg Val Pro Val Pro 
145 150 155 160 

Phe Ser Leu Ser Asp Leu Lys Gin lie Lys He Asp Leu Gly Lys Phe 

165 170 175 

Ser Asp Asn Pro Asp Gly Tyr He Asp Val Leu Gin Gly Leu Gly Gin 

180 185 190 

Ser Phe Asp Leu Thr Trp Arg Asp He Met Leu Leu Leu Asn Gin Thr 

195 200 205 

Leu Thr Pro Asn Glu Arg Ser Ala Ala Val Thr Ala Ala Arg Glu Phe 

210 215 220 

Gly Asp Leu Trp Tyr Leu Ser Gin Ala Asn Asn Arg Met Thr Thr Glu 
225 230 235 240 

Glu Arg Thr Thr Pro Thr Gly Gin Gin Ala Val Pro Ser Val Asp Pro 

245 250 2SS 

His Trp Asp Thr Glu Ser Glu His Gly Asp Trp Cys His Lys His Leu 

260 265 270 

Leu Thr Cys Val Leu Glu Gly Leu Arg Lys Thr Arg Lys Lys Pro Met 

275 280 28S 

Asn Tyr Ser Met Met Ser Thr He Thr Gin Gly Lys Glu Glu Asn Leu 
290 295 300 
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1 



5 



XO 



15 



Arg Gly Ser His Met Ala Ser Met Thr Giy Gly Gin Gin Met Giy Arg 

20 25 30 

He Leu Glu Arg He Leu Glu Asn Trp Asp Gin Cys Asp Thr Gin Thr 

35 40 45 

Leu Arg Lys Lys Arg Phe lie Phe Phe Cys Ser Thr Ala Trp Pro Gin 

50 55 60 

Tyr Pro Leu Gin Gly Arg Glu Thr Trp Leu Pro Glu Gly ser lie Aan 
65 70 75 80 

Tyr Asn I la He Leu Gin Leu Asp Leu Phe Cys Arg Lys Glu Gly Lys 

85 90 95 

Trp Ser Glu val Pro Tyr Val Gin Thr Phe Phe Ser Leu Arg Asp Asn 

100 10S no 

Ser Gin Leu Cys Lys Lys Cys Gly Leu Cys Pro Thr Gly Ser Pro Gin 

115 120 125 

Ser Pro Pro Pro Tyr Pro Ser Val Pro Ser Pro Thr Pro Ser Ser Thr 

130 135 140 

Asn Lys Asp Pro Pro Leu Thr Gin Thr Val Gin Lye Glu He Asp Lys 
145 150 155 160 

Gly Val Asn Asn Glu Pro Lys Ser Ala Asn He Pro Arg Leu Cys Pro 

165 170 175 

Leu Gin Ala Val Arg Gly Gly Glu Phe Gly Pro Ala Arg Val Pro Val 

180 185 190 

Pro Phe Ser Leu Ser Asp Leu Lys Gin He Lys He Asp Leu Gly Lys 

195 200 205 

Phe Ser Asp Asn Pro Asp Gly Tyr He Asp Val Leu Gin Gly Leu Gly 

210 215 220 

Gin Ser Phe Asp Leu Thr Trp Arg Asp He Met: Leu Leu Leu Asn Gin 
225 230 235 240 

Thr Leu Thr Pro Asn Glu Arg Ser Ala Ala Val Thr Ala Ala Arg Glu 

245 250 255 

Phe Gly Asp Leu Trp Tyr Leu Ser Gin Ala Asn Asn Arg Met Thr Thr 

260 265 270 

Glu Glu Arg Thr Thr Pro Thr Gly Gin Gin Ala Val Pro Ser Val Aep 

275 280 285 

Pro His Trp Asp Thr Glu ser Glu His Gly Asp Trp Cys His Lys His 
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295 



300 



Leu Leu Thr cys Val Leu Glu Gly Leu Arg Lys Thr Arg Lys Lys Pro 
305 310 315 320 

Met Asn Tyr Ser Met Met Ser Thr lie Thr Gin Gly Lys Glu Glu Aan 

325 330 335 

Leu Thr Ala Phe Leu Asp Arg Leu Arg Glu Ala Leu Arg Lys His Thr 

340 345 350 

Ser Leu Ser Pro Asp Ser He Glu Gly Gin Leu He Leu Lys Asp Lys 

355 360 365 

Phe He Thr Gin Ser Ala Ala Asp He Arg Lys Asn Phe Lys Ser Leu 

370 375 380 

Pro Lys Leu Ala Ala Ala Leu Glu His His His His His His 
3S5 390 395 

(2) INFORMATION FOR SEQ ID NO: 137: 
(i) SEQUENCE CHARACTERISTICS : 



(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 137: 
Met Ala Ser Met Thr Gly Gly Gin Gin Met Gly Arg lie Leu Glu Arg 
15 10 15 

lie Leu Glu Asn Trp Aep Gin Cye Asp Thr Gin Thr Leu Arg Lys Lys 

20 25 30 

Arg Phe lie Phe Phe Cys Ser Thr Ala Trp Pro Gin Tyr Pro Leu Gin 

35 40 45 

Gly Arg Glu Thr Trp Leu Pro Glu Gly Ser lie Asn Tyr Asn lie He 

50 55 60 

Leu Gin Leu Asp Leu Phe Cys Arg Lys Glu Gly Lys Trp Ser Glu Val 
65 70 75 80 

Pro Tyr Val Gin Thr Phe Phe Ser Leu Arg Asp Asn Ser Gin Leu Cys 

85 90 95 

Lys Lys Cys Gly Leu Cys Pro Thr Gly Ser Pro Gin Ser Pro Pro Pro 



(A) 



LENGTH: 378 amino acids 



5 



(B) 



TYPE: amino acid 



100 



105 



110 



10 
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Tyr Pro Ser Val Pro Ser Pro Thr Pro Ser Ser Thr Aen Lys Asp Pro 

115 120 125 

Pro Leu Thr Gin Thr Val Gin Lys Glu lie Asp Lys Cly VaL Asn Asn 

130 135 140 

Glu Pro Lys Ser Ala Asn lie Pro Arg Leu Cys Pro Leu Gin Ala Val 
145 150 155 160 

Arg Gly Gly Glu Phe Gly Pro Ala Arg Val Pro Val Pro Phe Ser Leu 

165 170 175 

Ser Asp Leu Lys Gin II* Lys lie Asp Leu Gly Lys Phe Ser Asp Asn 

180 185 190 

Pro Asp Gly Tyr lie Asp Val Leu Gin Gly Leu Gly Gin Ser Phe Asp 



Leu Thr Trp Arg Asp lie Met. Leu Leu Leu Asn Gin Thr Leu Thr Pro 

210 215 220 

Asn Glu Arg Ser Ala Ala Val Thr Ala Ala Arg Glu Phe Gly Asp Leu 
225 230 235 240 

Trp Tyr Leu Ser Gin Ala Asn Asn Arg Met Thr Thr Glu Glu Arg Thr 

245 250 25S 

Thr Pro Thr Gly Gin Gin Ala Val Pro Ser Val Asp Pro His Trp Asp 

260 265 270 

Thr Glu Ser Glu His Gly Asp Trp Cys His Lys His Leu Leu Thr Cys 

275 2S0 285 

Val Leu Glu Gly Leu Arg Lys Thr Arg Lys Lys Pro Met; Asn Tyr Ser 

290 295 300 

Met Met ser Thr lie Thr Gin Gly Lys Glu Glu Asn Leu Thr Ala Phe 
305 310 315 320 

Leu Asp Arg Leu Arg Glu Ala Leu Arg Lys His Thr Ser Leu Ser Pro 

325 330 335 

Asp Ser Zle Glu Gly Gin Leu He Leu Lys Asp Lys Phe lie Thr Gin 

340 345 350 

Ser Ala Ala Asp lie Arg Lys Asn Phe Lys Ser Leu Pro Lys Leu Ala 

355 _ 360 365 

Ala Ala Leu Glu His His His His His His 
370 375 

(2) INFORMATION FOR SEQ ID NO : 13 8: 
(i) SEQUENCE CHARACTERISTICS: 



195 



200 



205 



5 



(A) LENGTH: 25 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 138: 
CTTGGAGGGT GCATAACCAG GGAAT 2 5 

5 

(2) INFORMATION FOR SEQ ID NO: 13 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleotide 

10 (C) STRANDEDNESS : single 

{ D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 139: 
TGTCCGCTGT GCTCCTGATC 2 0 

15 

(2) INFORMATION FOR SEQ ID NO: 140: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleotide 

20 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 140: 
CTATGTCCTT TTGGACTGTT TGGGT 25 

25 

(2) INFORMATION FOR SEQ ID NO: 141: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 764 base pairs 

(B) TYPE: nucleotide 

3 0 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 141: 
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- 


18 - 








TGTCCGCTGT 


GCTCCTGATC 


CAGCACAGGC 


GCCCATTGCC 


TCTCCCAATT 


GGGCTAAAGG 


60 


CTTGCCATTG 


TTCCTGCACA 


CCTAAGTCCC 


TGGGTTCATC 


CTAATCGAGC 


TGAACACTAG 


120 


TCACTGGGTT 


CCACGGTTCT 


CTTCCATCAC 


CCATCCCTTC 


TAATAGAGCT 


ATAACACTCA 


180 


CTCCATCGTC 


CAAGATTCCA 


TTCCTTGGAA 


TCCGTGAGAC 


CAAGAACCCC 


AGGTCAGAGA 


240 


ACACAAGGCT 


TCCCACCATG 


TTGGAAGCAG 


CCCACCACCA 


TTTTGGAAGC 


AGCCCGCCAC 


300 


TAT CTTG GG A 


GCTCTGGGAG 


CAAGGACCCC 


AGGTAACAAT 


TTGGTGACCA 


CGAAGGGACC 


360 


TGAATCCGCA 


ACGATGAAGG 


GATCTCCAAA 


GCAATTGGAA 


ATGTTCCTCC 


CAAGGCAAAA 


420 


ATGCCCCTAA 


GATGTATTCT 


GGAGAATTGG 


GACCAATTTG 


ACCCTCAGAC 


AGTAAGAAAA 


480 


AAATGACTTA 


TATTCTTCTG 


CAGTACCGCC 


CTGGCCACGA 


TATCCTCTTC 


AAGGGGGAGA 


S40 


AACCTGGCCT 


CCTGAGGGAA 


GTATAAATTA 


TAACACCATC 


TTACACCTAG 


ACCTGTTTTG 


600 


TAGAAAAGGA 


GGCAAATCGA 


GTGAAGTGCC 


ATATTTACAA 


ACTTTCTTTT 


CATTAAAAGA 


660 


CAACTCGCAA 


TTATGTTAAC 


AGTGTGATTT 


GTGTTCCTAC 


ACGGAAGCCC 


TCAGATTCTA 


720 


CTCCCCACCC 


CCGGCATCTC 


CCCTGAATCC 


CTCCCCAACT 


TATT 




764 



(2) INFORMATION FOR SEQ ID NO: 142: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 800 base pairs 
5 (B) TYPE: nucleotide 

(C) STRANDEDNESS : single 



( D ) TOPOLOGY : 1 inear 
(ii) MOLECULE TYPE: cDNA 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 142: 



TGT CCCCTGT 


GCTCCTGATC 


CAGCACAGGC 


GCCCATTGCC 


TCTCCCAATT 


GGGCTAAAGG 


60 


CTTGCCATTG 


TTCCTGCACA 


GCTAAGTGCC 


TGGGTTCATC 


CTAATCGAGC 


TGAACACTAG 


120 


TCACTGGGTT 


CCACGGTTCT 


CTTCCATCAC 


CCATGCCTTC 


TAATAGAGCT 


ATAACACTCA 


180 


CTGCATGGTC 


CAAGATTCCA 


TTCCTTGGAA 


TCCGTGAGAC 


CAAGAACCCC AGGTCAGAGA 


240 


ACACAAGGCT 


TCCCACCATG 


TTGGAAGCAG 


CCCACCACCA 


TTTTGGAAGC 


GGCCCGCCAC 


300 


TATCTTGGGA 


GCTCTGGGAG 


CAAGGACCCC 


CAGGTAACAA 


TTTGGTGACC 


ACGAAGGCAC 


360 


CTGAATCCGC 


AACCATGAAG 


GGATCTCCAA 


AGCAATTGGA 


AATGTTCCTC 


CCAAGGCAAA 


420 


AATGCCCCTA 


AGATGTATTC 


TGGAGAATTG 


GGACCAATCT 


GACCCTCAGA 


CAGTAAGAAA 


480 


AAAAATGACT 


TATATTCTTC 


TGCAGTACCG 


CCTGGCCACG 


GATATCCTCT 


TCAAGGOGGA 


S40 


GAAACCTGGC 


CTCCTGAGGG 


AAGTATAAAT 


TATAACACCA 


TCTTACAGCT AGACCTGTTT 


600 


TGTAGAAAAG 


GAGGCAAATG 


GAGTGAAGTG 


CCATATTTAC 


AAACTTTCTT 


TTCATTAAAA 


660 


CACAACTCGC 


AATTATGTAA 


ACAGTGTGAT 


TTGTGTCCTA 


CAGGAAGCCC 


TCAGATCTAC 


720 


CTCCCTACCC 


CCGCATCTCC 


CTGACTCCTT 


CCCCAACTAA 


TAAGGACCCA 


CTTCAGCCCA 


780 



10 

AACAGTCCAA AAGGACATAG 800 
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